Statistical Learning with Python

(5 customer reviews)

64,584.99

Category:

Description

This is an introductory-level course in supervised learning focusing on regression and classification methods. The syllabus includes linear and polynomial regression, logistic regression, and linear discriminant analysis; cross-validation and the bootstrap; model selection and regularization methods (ridge and lasso); nonlinear models, splines, and generalized additive models; tree-based methods, random forests, and boosting; support-vector machines; neural networks and deep learning; survival models; and multiple testing. Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical).

This is not a math-heavy class, so we try to describe the methods without heavy reliance on formulas and complex mathematics. We focus on what we consider to be the essential elements of modern data science. Computing in this course is done in Python. There are lectures devoted to Python, giving tutorials from the ground up and progressing with more detailed sessions that implement the techniques in each chapter. We also offer a separate and original version of this course called Statistical Learning with R – the chapter lectures are the same, but the lab lectures and computing are done using R.