Apache Spark & Big Data Evolution | Kaggle NYC
Explore the history and growth of big data processing, focusing on Apache Spark and its ecosystem components. π

Biased Outliers
52 views β’ Jan 30, 2020

About this video
In this talk, we will discuss the history of big data processing that has led to the development of Spark. We will also discuss the different components of Spark and their complementary open source projects. Finally, we will talk about cutting edge big data tools being used in production today by startups and enterprises.
Biography-
Gary Cheung - Cloud Data Architect @ Blackboard Insurance
Gary is currently a Cloud Data Architect at Blackboard Insurance where he evaluates different tools and approaches for building out a robust data ecosystem. Prior to Blackboard, Gary was a Big Data Consultant at Caserta and the first data engineer hire at Capsule Pharmacy. He has worked with big data problems across multiple industries such as Investment Banking, Ad Tech, Healthcare and Insurance. Gary also holds a Masters in Biotechnology and Bioinformatics from the University of Pennsylvania.
Slides are in our slack https://kagglenyc.com
Thanks to our sponsors:
https://getlitt.app
https://harrymoreno.com/hire-me
Biography-
Gary Cheung - Cloud Data Architect @ Blackboard Insurance
Gary is currently a Cloud Data Architect at Blackboard Insurance where he evaluates different tools and approaches for building out a robust data ecosystem. Prior to Blackboard, Gary was a Big Data Consultant at Caserta and the first data engineer hire at Capsule Pharmacy. He has worked with big data problems across multiple industries such as Investment Banking, Ad Tech, Healthcare and Insurance. Gary also holds a Masters in Biotechnology and Bioinformatics from the University of Pennsylvania.
Slides are in our slack https://kagglenyc.com
Thanks to our sponsors:
https://getlitt.app
https://harrymoreno.com/hire-me
Video Information
Views
52
Duration
01:01:56
Published
Jan 30, 2020