Dimension Reduction (part 3) - Week 3 | Coursera

Dimension Reduction (part 3)

Video placeholder

Loading...

Exploratory Data Analysis

Johns Hopkins University

4.7 (6,058 ratings)

|

180K Students Enrolled

Course 4 of 5 in the Data Science: Foundations using R Specialization

Enroll for Free

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.

Skills You'll Learn

Cluster Analysis, Ggplot2, R Programming, Exploratory Data Analysis

Reviews

4.7 (6,058 ratings)

5 stars
74.29%
4 stars
21.12%
3 stars
3.40%
2 stars
0.74%
1 star
0.42%

IA

Jan 17, 2016

Very nice course, plotting data to explore and understand various features and their relationship is the key in any research domain, and this course teaches the skill required to achieve this.

EK

Jun 5, 2020

Awesome course that expands on your R knowledge. Only nitpick is that some of the links don't work and the videos need an overhaul as there seem to be little to no updates since 2015/2016.

From the lesson

Week 3

Welcome to Week 3 of Exploratory Data Analysis. This week covers some of the workhorse statistical methods for exploratory analysis. These methods include clustering and dimension reduction techniques that allow you to make graphical displays of very high dimensional data (many many variables). We also cover novel ways to specify colors in R so that you can use color as an important and useful dimension when making data graphics. All of this material is covered in chapters 9-12 of my book Exploratory Data Analysis with R.

K-Means Clustering (part 1)5:46

K-Means Clustering (part 2)4:26

Dimension Reduction (part 1)7:55

Dimension Reduction (part 2)9:25

Dimension Reduction (part 3)6:42

Taught By

Roger D. Peng, PhD
Professor of Statistics and Data Sciences
Jeff Leek, PhD
Chief Data Officer, Vice President, and J Orin Edson Foundation Chair of Biostatistics in Public Health Sciences
Brian Caffo, PhD
Professor, Biostatistics

Try the Course for Free

Explore our Catalog

Join for free and get personalized recommendations, updates and offers.