Using Python to Control and Document Your Data Science Processes
▪ Data Science with Python Training: Data types and objects
▪ Loading packages, namespaces ▪ Reading and writing data ▪ Simple plotting ▪ Control flow ▪ Debugging ▪ Code profiling
Copyright @ 2015 Learntek. All Rights Reserved. 3
Accessing and Preparing Data
▪ Acquiring Data with Python
▪ Loading from CSV files ▪ Accessing SQL databases ▪ Cleansing Data with Python ▪ Stripping out extraneous information ▪ Normalizing data ▪ Formatting data
Copyright @ 2015 Learntek. All Rights Reserved. 4
Numerical Analysis, Data Exploration, and Data Visualization with NumPy Arrays & Matplotlib
▪ The NumPy array
▪ 2D plotting with Matplotlib ▪ N-dimensional array operations and manipulations ▪ Memory mapped files
Copyright @ 2015 Learntek. All Rights Reserved. 5
Exploring Data with Pandas and scipy.stats
▪ Data manipulation with Pandas
▪ Statistical analysis with Pandas ▪ Time series analysis with Pandas ▪ Overview of statistical tools in scipy.stats
Copyright @ 2015 Learntek. All Rights Reserved. 6
Machine Learning with scikit-learn
▪ Input: 2D, samples, and features
▪ Pre-processing data ▪ Regression ▪ Classification ▪ Model selection