Talks

2022

Learn Open Source Tools & Frameworks on Uplimit

Open Source Startup Podcast, Sept 2022: Talk about building engaging learning experiences to grow open source adoption and creating true evangelists.

Technical and data training that feels like the real world

Summer Community Days, July 2022: Talk about designing engaging learning experiences for adults that feel high touch and closer to real jobs.

Every Engineer Should and Can Learn Machine Learning

KD Nuggets, June 2022: Interview about learning ML, breaking into the field and how to move from a software engineer to a ML engineer.

2020

Keynote Panel: State of AI in the Enterprise

Global AI Conference, October 2020: Panel discussion on challenges in building, deploying and productionizing models in an enterprise setting.

2018

High-Performance TensorFlow for Accelerators

Tensorflow Roadshow, Bengaluru, October 2018: Talk about the Tensor Processing Units (TPUs). Covers the internals of the hardware design of TPUs, how to run your Tensorflow models on accelerators and performance tuning for your models when using Accelerators.

Museum of Machine Arts

PyBay, August 2018: Talk about recent developments in Deep Learning generated/assisted art. Covering recent papers about GANs, Neural Style transfer, Pix2Pixel etc. (Slides)

Presented similar talk on machine generated art at: Global Big Data Conference, August 2018: (Slides)

Jeju Deep Learning Camp

Jeju Deep Learning Camp, July 2018: One of the mentors for the deep learning summer camp in Jeju, Korea. Mentored students on projects across variety of domains such as Reinforcement Learning, Recommendation Systems, Computer Vision and Natural Language Processing.

New APIs in Tensorflow

Global Artificial Intelligence Conference, April 2018: Talk about the Tensorflow Eager execution and the Dataset APIs. Covers on how to change your code from using graph mode to eager execution and in the second half covered the datasets api to write performant input pipelines. (Slides)

2017

Unified Batch and Stream processing with Apache Beam

PyData, July 2017: Apache Beam is a parallel programming model that allows one to implement batch and streaming data processing jobs that can run on a variety of execution engines like Apache Spark and Google Cloud Dataflow. (Slides | Video)

Presented similar talks on Apache Beam at:

PyBay, August 2017: (Slides | Video)
Cloud+Data Nextcon, September 2017: (Slides)
Florida Pycon, October 2017: (Slides)
All things Python, October 2017: (Slides)

Intro to Recommendation Systems

Metis, Auguest 2017: Guest lecture on building real world recommendation systems. Talk goes over building multiple iterations of the system in increasing complexity.

2016

Cassandra batch loading for building Data Products

Cassandra Meetup, July 2016: Talk about Nostos (batch loading service at Coursera) and some of its use cases in data products such as recommendations, search and prediction models. Covers some of the design choice and tradeoffs made in building Nostos and explains how the system evolved over time. (Slides | Video)

Extending our workflow service for use cases beyond ETL

Big Data Meetup, May 2016: Talk about Dataduct the workflow / ETL service at Coursera and how it is now being used for other use cases beyond just ETL such as machine learning, predictions and bulk loading into cassandra. (Slides)

2015

Large-Scale ETL Data Flows with AWS Data Pipeline & Dataduct

AWS Re:Invent, Oct 2015: Dive deep into AWS Data Pipeline and Dataduct, an open source framework built at Coursera to manage pipelines and create reusable patterns to expedite developer productivity. (Slides | Video)

Sourabh Bajaj