MyPage is a personalized page based on your interests.The page is customized to help you to find content that matters you the most.


I'm not curious

mastering data integration (ETL) with pentaho kettle PDI

Course Summary

hands on , real case studies ,tips, examples , walk trough a full project from start to end based on mySQL sakila DB.


  • +

    Course Syllabus

    • Introduction
      • course promo intro
      • the ETL course intro
    • ETL concept and environment
      • what is ETL
      • When do we use ETL (data integration)
      • the data warehouse concept
      • Analytical structure
      • ETL tools comparison
      • data sources part 1
      • data sources part 2
    • Installations
      • What we are going to install?
      • Install mysql
      • Install JRE - java runtime
      • Install navicat - mysql manager
      • Install sakila database (and notepad++)
      • Install pentaho data integration (kettle)
      • install power architect
      • Install expresso
    • Software Walkthroughs
      • power architect walkthrough
      • Navicat walkthrough
    • Hands on - Pentaho
      • Pentaho PDI getting started
      • kettle variables part 1
      • kettle variables part 2
      • kettle database connection
      • Pentaho repositories
      • schema introduction
    • The Date Dimension
      • dim date intro
      • generate rows part 1
      • generate rows part 2
      • generate rows part 3
      • the add sequence
      • the select values
      • the mapping / string cut / string concat
      • the table output
      • the string operation
      • dim date summary
    • dim time
      • dim time intro
      • arrange steps and create hours and minutes
      • the Cartesian step
      • Cartesian customer example
      • the modified java script value
      • the field set / filter rows / dummy steps
      • dim time summary
    • dim staff
      • dim staff intro
      • the table input
      • the data grid / value mapper
      • consideration 1 - historical data in dimensions
      • consideration 2 - truncate or update table
      • consideration 3 - be like mike - deleted rows on dimension
    • dim store
      • dim store intro
      • the database lookup
      • the stream lookup
      • the insert /update step
      • the system info
    • dim customer
      • dim customer intro
      • control "changed data only" input
      • down it goes with the stream
      • slow changing dimension - concept
      • slow changing dimension - example
    • dim film
      • dim film intro
      • objectives
      • the number range
      • the merge join / sort rows / value null
      • the denormaiser / split fields to rows
    • fact rentals
      • fact rental intro
      • the inventory - film and store id
      • slow changing dimension on fact table
      • counter and date diff calculation
      • key date handling
      • the time dimension check
      • error handling step
    • Go to production
      • production steps intro
      • the final job
      • kitchen batch file
      • schedule jobs
      • validation - secure the stream part 1
      • validation - secure the stream part 2
      • logging
      • performance
    • Whats next...
      • need more input
      • this is a goodbye


Course Fee:
USD 127

Course Type:

Self-Study

Course Status:

Active

Workload:

1 - 4 hours / week

Attended this course?

Back to Top

Awards & Accolades for MyTechLogy
Winner of
REDHERRING
Top 100 Asia
Finalist at SiTF Awards 2014 under the category Best Social & Community Product
Finalist at HR Vendor of the Year 2015 Awards under the category Best Learning Management System
Finalist at HR Vendor of the Year 2015 Awards under the category Best Talent Management Software
Hidden Image Url

Back to Top