Section 1: Basic to Advance SQL
- While testing big data application, we have to play with huge amount of data every data
- Which could be raw data or processed data.
- To work with data in bigdata testing we will have to write and execute lot of queries in
- SQL is a pre-requisite to learn HiveQL
Section 2: Python
Python is a powerful, flexible, open-source language that is easy to learn, easy to use, and has powerful libraries for data manipulation and analysis. Python is a popular, general-purpose programming language with an emphasis on being readable and allowing programmers to use fewer lines of code to accomplish tasks than in older languages. Libraries such as NumPy, SciPy, and Matplotlib make it useful for scientific computing.
Python is an excellent tool for data analysis for four reasons:
- Open source
Section 3 Big Data demands better shell skills
Complete Big data architecture would be setup on Unix machine, If we have prior understand of Unix and Shell script, it will be easy for us to work smoothly
This course if very useful for professional who are looking opportunities in Manual and Automation Testing.