MyPage is a personalized page based on your interests.The page is customized to help you to find content that matters you the most.

I'm not curious

Mining Massive Datasets

Course Summary

This class teaches algorithms for extracting models and other information from very large amounts of data. The emphasis is on techniques that are efficient and that scale well.

  • +

    Course Syllabus

    Week 1:
    Link Analysis -- PageRank Week 2:
    Locality-Sensitive Hashing -- Basics + Applications
    Distance Measures
    Nearest Neighbors
    Frequent Itemsets Week 3:
    Data Stream Mining
    Analysis of Large Graphs Week 4:
    Recommender Systems
    Dimensionality Reduction Week 5:
    Computational Advertising Week 6:
    Support-Vector Machines
    Decision Trees
    MapReduce Algorithms Week 7:
    More About Link Analysis --  Topic-specific PageRank, Link Spam.
    More About Locality-Sensitive Hashing

  • +

    Recommended Background

    A course in database systems programming (e.g., SQL) is recommended, as is a basic course on algorithms and data structures.

  • +

    Course Format

    There will be about 2 hours of video to watch each week, broken into small segments.  There will be automated homeworks to do for each week, and a final exam.

  • +

    Suggested Reading

    There is a free book "Mining of Massive Datasets, by Leskovec, Rajaraman, and Ullman (who by coincidence are the instructors for this course :-).  You can download it at  Hardcopies can be purchased from Cambridge Univ. Press.

Course Fee:

Course Type:


Course Status:



1 - 4 hours / week

Attended this course?

Back to Top

Awards & Accolades for MyTechLogy
Winner of
Top 100 Asia
Finalist at SiTF Awards 2014 under the category Best Social & Community Product
Finalist at HR Vendor of the Year 2015 Awards under the category Best Learning Management System
Finalist at HR Vendor of the Year 2015 Awards under the category Best Talent Management Software
Hidden Image Url

Back to Top