Advanced Apache Spark Training - Sameer Farooqui (Databricks)

 Sparkwinds
3

Video Details

Live Big Data Training from Spark Summit 2015 in New York City.

"Today I'll cover Spark core in depth and get you prepared to use Spark in your own prototypes. We'll start by learning about the big data ecosystem, then jump into RDDs (Resilient Distributed Datasets). Then we'll talk about integrating Spark with resource managers like YARN and Standalone mode. After a peek into some Spark Internals, we touch base upon Accumulators and Broadcast Variables. Finally, we end with Spark Streaming and a technical explanation of how the 100 TB sort competition was won in 2014." - Sameer

Slides:
https://spark-summit.org/wp-content/uploads/2015/03/SparkSummitEast2015-AdvDevOps-StudentSlides.pdf


Want to learn more about Spark?

Check out my new class, "Exploring Wikipedia with Apache Spark", recorded June 2016:
https://www.youtube.com/watch?v=vlVnSpJ6TDE&t=21m23s


// About the Presenter //
Sameer Farooqui is a Technology Evangelist at Databricks where he helps promote the adoption of Apache Spark. As a founding member of the training team, he created and taught advanced Spark classes at private clients, meetups and conferences globally.

Follow Sameer on -
Twitter: https://twitter.com/blueplastic
LinkedIn: https://www.linkedin.com/in/blueplastic

Date Added: 2020-12-18

Category: Sparkwinds

Watched 8 times

Tags: None

Loading...