Mathematics for Machine Learning: PCA

Mathematics for Machine Learning: PCA

This course is part of Mathematics for Machine Learning Specialization

Taught in English

Some content may not be translated

Instructor: Marc Peter Deisenroth

87,705 already enrolled

Included with Coursera Plus

Learn more

Course

Gain insight into a topic and learn the fundamentals

4.0

(3,050 reviews)

80%

Intermediate level

Some related experience required

20 hours (approximately)

Flexible schedule

Learn at your own pace

View course modules

What you'll learn

Implement mathematical concepts using real-world data
Derive PCA from a projection perspective
Understand how orthogonal projections work
Master PCA

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

11 quizzes

Course

Gain insight into a topic and learn the fundamentals

4.0

(3,050 reviews)

80%

Intermediate level

Some related experience required

20 hours (approximately)

Flexible schedule

Learn at your own pace

View course modules

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Build your subject-matter expertise

This course is part of the Mathematics for Machine Learning Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There are 4 modules in this course

This intermediate-level course introduces the mathematical foundations to derive Principal Component Analysis (PCA), a fundamental dimensionality reduction technique. We'll cover some basic statistics of data sets, such as mean values and variances, we'll compute distances and angles between vectors using inner products and derive orthogonal projections of data onto lower-dimensional subspaces. Using all these tools, we'll then derive PCA as a method that minimizes the average squared reconstruction error between data points and their reconstruction.

At the end of this course, you'll be familiar with important mathematical concepts and you can implement PCA all by yourself. If you’re struggling, you'll find a set of jupyter notebooks that will allow you to explore properties of the techniques and walk you through what you need to do to get on track. If you are already an expert, this course may refresh some of your knowledge. The lectures, examples and exercises require: 1. Some ability of abstract thinking 2. Good background in linear algebra (e.g., matrix and vector algebra, linear independence, basis) 3. Basic background in multivariate calculus (e.g., partial derivatives, basic optimization) 4. Basic knowledge in python programming and numpy Disclaimer: This course is substantially more abstract and requires more programming than the other two courses of the specialization. However, this type of abstract thinking, algebraic manipulation and programming is necessary if you want to understand and develop machine learning algorithms.

Principal Component Analysis (PCA) is one of the most important dimensionality reduction algorithms in machine learning. In this course, we lay the mathematical foundations to derive and understand PCA from a geometric point of view. In this module, we learn how to summarize datasets (e.g., images) using basic statistics, such as the mean and the variance. We also look at properties of the mean and the variance when we shift or scale the original data set. We will provide mathematical intuition as well as the skills to derive the results. We will also implement our results in code (jupyter notebooks), which will allow us to practice our mathematical understand to compute averages of image data sets. Therefore, some python/numpy background will be necessary to get through this course. Note: If you have taken the other two courses of this specialization, this one will be harder (mostly because of the programming assignments). However, if you make it through the first week of this course, you will make it through the full course with high probability.

What's included

8 videos6 readings3 quizzes1 programming assignment1 discussion prompt2 ungraded labs1 plugin

8 videosTotal 27 minutes

Introduction to the course3 minutesPreview module
Welcome to module 10 minutes
Mean of a dataset4 minutes
Variance of one-dimensional datasets4 minutes
Variance of higher-dimensional datasets5 minutes
Effect on the mean4 minutes
Effect on the (co)variance3 minutes
See you next module!0 minutes

6 readingsTotal 40 minutes

About Imperial College & the team5 minutes
How to be successful in this course5 minutes
Grading policy5 minutes
Additional readings & helpful references5 minutes
Set up Jupyter notebook environment offline10 minutes
Symmetric, positive definite matrices10 minutes

3 quizzesTotal 50 minutes

Variance of 1D datasets15 minutes
Mean of datasets15 minutes
Covariance matrix of a two-dimensional dataset20 minutes

1 programming assignmentTotal 30 minutes

Mean/covariance of a dataset + effect of a linear transformation30 minutes

1 discussion promptTotal 15 minutes

Nice to meet you!15 minutes

2 ungraded labsTotal 120 minutes

NumPy Tutorial60 minutes
Mean/covariance of a dataset + effect of a linear transformation60 minutes

1 pluginTotal 15 minutes

Pre-course Survey15 minutes

Data can be interpreted as vectors. Vectors allow us to talk about geometric concepts, such as lengths, distances and angles to characterize similarity between vectors. This will become important later in the course when we discuss PCA. In this module, we will introduce and practice the concept of an inner product. Inner products allow us to talk about geometric concepts in vector spaces. More specifically, we will start with the dot product (which we may still know from school) as a special case of an inner product, and then move toward a more general concept of an inner product, which play an integral part in some areas of machine learning, such as kernel machines (this includes support vector machines and Gaussian processes). We have a lot of exercises in this module to practice and understand the concept of inner products.

What's included

8 videos1 reading4 quizzes1 programming assignment2 ungraded labs

8 videosTotal 36 minutes

Welcome to module 21 minutePreview module
Dot product4 minutes
Inner product: definition5 minutes
Inner product: length of vectors7 minutes
Inner product: distances between vectors3 minutes
Inner product: angles and orthogonality5 minutes
Inner products of functions and random variables (optional)7 minutes
Heading for the next module!0 minutes

1 readingTotal 20 minutes

Basis vectors20 minutes

4 quizzesTotal 110 minutes

Properties of inner products20 minutes
Angles between vectors using a non-standard inner product30 minutes
Dot product30 minutes
General inner products: lengths and distances30 minutes

1 programming assignmentTotal 60 minutes

Inner products and angles60 minutes

2 ungraded labsTotal 120 minutes

Inner products and angles60 minutes
Optional: K-nearest Neighbors Algorithm60 minutes

In this module, we will look at orthogonal projections of vectors, which live in a high-dimensional vector space, onto lower-dimensional subspaces. This will play an important role in the next module when we derive PCA. We will start off with a geometric motivation of what an orthogonal projection is and work our way through the corresponding derivation. We will end up with a single equation that allows us to project any vector onto a lower-dimensional subspace. However, we will also understand how this equation came about. As in the other modules, we will have both pen-and-paper practice and a small programming example with a jupyter notebook.

What's included

6 videos1 reading2 quizzes1 programming assignment1 ungraded lab

6 videosTotal 24 minutes

Welcome to module 30 minutesPreview module
Projection onto 1D subspaces7 minutes
Example: projection onto 1D subspaces3 minutes
Projections onto higher-dimensional subspaces8 minutes
Example: projection onto a 2D subspace3 minutes
This was module 3!0 minutes

1 readingTotal 20 minutes

Full derivation of the projection20 minutes

2 quizzesTotal 85 minutes

Projection onto a 1-dimensional subspace25 minutes
Project 3D data onto a 2D subspace60 minutes

1 programming assignmentTotal 30 minutes

Orthogonal projections30 minutes

1 ungraded labTotal 60 minutes

Orthogonal projections60 minutes

We can think of dimensionality reduction as a way of compressing data with some loss, similar to jpg or mp3. Principal Component Analysis (PCA) is one of the most fundamental dimensionality reduction techniques that are used in machine learning. In this module, we use the results from the first three modules of this course and derive PCA from a geometric point of view. Within this course, this module is the most challenging one, and we will go through an explicit derivation of PCA plus some coding exercises that will make us a proficient user of PCA.

What's included

10 videos5 readings2 quizzes1 programming assignment2 ungraded labs1 plugin

10 videosTotal 51 minutes

Welcome to module 41 minutePreview module
Problem setting and PCA objective7 minutes
Finding the coordinates of the projected data5 minutes
Reformulation of the objective10 minutes
Finding the basis vectors that span the principal subspace7 minutes
Steps of PCA4 minutes
PCA in high dimensions5 minutes
Other interpretations of PCA (optional)7 minutes
Summary of this module0 minutes
This was the course on PCA0 minutes

5 readingsTotal 90 minutes

Vector spaces20 minutes
Orthogonal complements20 minutes
Multivariate chain rule20 minutes
Lagrange multipliers20 minutes
Did you like the course? Let us know!10 minutes

2 quizzesTotal 70 minutes

Chain rule practice40 minutes
Steps of PCA 30 minutes

1 programming assignmentTotal 30 minutes

PCA30 minutes

2 ungraded labsTotal 120 minutes

Principal Components Analysis (PCA)60 minutes
Optional: Demonstrations of PCA60 minutes

1 pluginTotal 15 minutes

Post-Course Survey15 minutes

Instructor

Instructor ratings

3.9 (407 ratings)

Marc Peter Deisenroth

Imperial College London

1 Course87,705 learners

Offered by

Imperial College London

Recommended if you're interested in Machine Learning

Imperial College London
Mathematics for Machine Learning: Multivariate Calculus
Course
Imperial College London
Mathematics for Machine Learning: Linear Algebra
Course
University of Colorado Boulder
Algebra and Differential Calculus for Data Science
Course
Imperial College London
Mathematics for Machine Learning
Specialization

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

Showing 3 of 3050

4.0

3,050 reviews

5 stars
51.22%
4 stars
22.37%
3 stars
12.65%
2 stars
6.67%
1 star
7.06%

Reviewed on Jun 27, 2020

Reviewed on Dec 27, 2019

Reviewed on Jul 19, 2022

View more reviews

New to Machine Learning? Start here.

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

You will need good python knowledge to get through the course.

This course is significantly harder and different in style: it uses more abstract concepts and requires much more programming experience than the other two courses. Therefore, when you complete the full specialization, you will be equipped with a much more diverse set of skills.

Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:

The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

Mathematics for Machine Learning: PCA

Course

What you'll learn

Skills you'll gain

Details to know

Course

See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise

Earn a career certificate

There are 4 modules in this course

Statistics of Datasets

What's included

Inner Products

What's included

Orthogonal Projections

What's included

Principal Component Analysis

What's included

Instructor

Offered by

Recommended if you're interested in Machine Learning

Mathematics for Machine Learning: Multivariate Calculus

Mathematics for Machine Learning: Linear Algebra

Algebra and Differential Calculus for Data Science