Handling Fast Data with Apache Spark SQL and Streaming
Pluralsight
Course Summary
Apache Spark is a leader in enabling quick and efficient data processing. This course will teach you how to use Spark's SQL, Streaming, and even the newer Structured Streaming APIs to create applications able to handle data as it arrives.
-
+
Course Description
Analyzing data used to be something you did once a night. Now you need to be able to process data on the fly so you can provide up to the minute insights. But, how do you accomplish in real time what used to take hours without a complicated code base? In this course, Handling Fast Data with Apache Spark SQL and Streaming, you'll learn to use Apache Spark Streaming and SQL libraries as a great way to handle this new world of real time, fast data processing. First, you'll dive into SparkSQL. Next, you'll explore how to catch potential fraud by analyzing streams with Spark Streaming. Finally, you'll discover the newer Structured Streaming API. By the end of this course, you'll have a deeper understanding of these APIs, along with a number of streaming concepts that have driven the API design.
-
+
Course Syllabus
Course Overview- 2m 4s
—Course Overview 2m 4sIntroduction- 21m 49s
—Introduction 1m 52s
—Fast Data: Avoiding Lambda Architecture Complexity 5m 57s
—What to Expect 3m 57s
—Spark 2.x 7m 51s
—Resources 1m 15s
—Summary 0m 55sQuerying Data with the DataFrames (Part 1)- 43m 10sQuerying Data with the DataFrames (Part 2)- 41m 8sImproving Type Safety with Datasets- 41m 20sProcessing Data with the Streaming API- 1h 7mOptimizing, Structured Streaming, and Spark 2.x- 58m 3s