MyPage is a personalized page based on your interests.The page is customized to help you to find content that matters you the most.


I'm not curious

Beginning Data Exploration and Analysis with Apache Spark

Course Summary

80% of a data scientist's job is data preparation. This course is all about data preparation i.e. cleaning, transforming, summarizing data using Spark.


  • +

    Course Syllabus

    Course Overview
    - 1m 42s

    —Course Overview 1m 42s
    Getting Started with Spark's Resilient Distributed Datasets
    - 27m 11s

    —The Role of Spark in Data Analysis 6m 3s
    —Understanding the Components of Spark 4m 17s
    —Installing Spark Standalone in a Local Environment 4m 21s
    —Hello World: Loading a Data Set 3m 47s
    —Understanding Resilient Distributed Datasets 8m 41s
    Transforming and Cleaning Unstructured Data
    - 32m 1s
    Summarizing Data Along Dimensions
    - 30m 30s
    Modeling Relationships in the Marvel Social Universe
    - 25m 59s


Course Fee:
USD 29

Course Type:

Self-Study

Course Status:

Active

Workload:

1 - 4 hours / week

Attended this course?

Back to Top

Awards & Accolades for MyTechLogy
Winner of
REDHERRING
Top 100 Asia
Finalist at SiTF Awards 2014 under the category Best Social & Community Product
Finalist at HR Vendor of the Year 2015 Awards under the category Best Learning Management System
Finalist at HR Vendor of the Year 2015 Awards under the category Best Talent Management Software
Hidden Image Url

Back to Top