Advanced Techniques for Exploring Data Sets with Pandas
Explore popular datasets in R, while mastering advanced techniques used for them
In this course, you will learn how to start using pandas from end-to-end: from getting your data into pandas; using pandas to manipulate, transform, analyze, and visualize data; to ultimately taking your transformed data out of pandas into any number of formats.
This course will get you (or anyone who has never used pandas) started on using it as a complete end-to-end data analysis workflow. You will start by setting up Python, pandas, and Jupyter notebooks. You will learn how to use Jupyter notebooks to run Python code. We will then show how to get data into pandas and do some exploratory analysis. You will learn how to manipulate and reshape data using pandas methods. You will also learn how to deal with missing data from your datasets, how to draw charts and plots using pandas and Matplotlib, and how to create some cool visualizations for your audience. Finally, you will wrap-up your newly gained pandas knowledge by learning how to get data out of pandas into some popular file formats.
About the Author
Harish Garg is a Data Analyst, author, and Software Developer who is really passionate about Data Science and the Python programming language. He is a graduate from Udacity's Data Analyst Nanodegree program. He has 17 years of industry experience, which includes data analysis using Python, developing and testing enterprise and consumer software, managing projects and software teams, and creating training material and tutorials. Harish also worked for 11 years for Intel Security (previously McAfee, Inc.).
He regularly contributes articles and tutorials on data analysis and Python. He is also active in the open data community and is a contributing member of the Data4Democracy open data initiative. He has written data analysis pieces for think tan takshashila.