Session: Machine Learning with Apache Beam

Apache Beam is an open source tool for building distributed data pipelines. This talk will explore how Beam can be used to perform common machine learning tasks like inference, pre and post processing with pandas-like Dataframes, and some forms of training. The talk will have a heavy demo component showing multiple examples of using Beam for ML. An attendee can expect to leave this talk with a high level understanding of Beam and the ability to use Beam to easily parallelize their ML workloads.

Session Speakers:

Danny McCormick

Danny is a senior software engineer at Google in Durham, NC where he gets to spend most of his time working on Apache Beam. Previously, Danny worked at GitHub where he helped launch GitHub Actions [Read More]

This track
proudly sponsored by