Are you interested in predicting future outcomes using your data? This course helps you do just that! Machine learning is the process of developing, testing, and applying predictive algorithms to achieve this goal. Make sure to familiarize yourself with course 3 of this specialization before diving into these machine learning concepts. Building on Course 3, which introduces students to integral supervised machine learning concepts, this course will provide an overview of many additional concepts, techniques, and algorithms in machine learning, from basic classification to decision trees and clustering. By completing this course, you will learn how to apply, test, and interpret machine learning algorithms as alternative methods for addressing your research questions.
Este curso faz parte do Programa de cursos integrados Análise e interpretação de dadosAnálise e Interpretação de Dados
oferecido por
Informações sobre o curso
Resultados de carreira do aprendiz
29%
36%
17%
Habilidades que você terá
Resultados de carreira do aprendiz
29%
36%
17%
oferecido por

Universidade Wesleyan
Wesleyan University, founded in 1831, is a diverse, energetic liberal arts community where critical thinking and practical idealism go hand in hand. With our distinctive scholar-teacher culture, creative programming, and commitment to interdisciplinary learning, Wesleyan challenges students to explore new ideas and change the world. Our graduates go on to lead and innovate in a wide variety of industries, including government, business, entertainment, and science.
Programa - O que você aprenderá com este curso
Decision Trees
In this session, you will learn about decision trees, a type of data mining algorithm that can select from among a large number of variables those and their interactions that are most important in predicting the target or response variable to be explained. Decision trees create segmentations or subgroups in the data, by applying a series of simple rules or criteria over and over again, which choose variable constellations that best predict the target variable.
Random Forests
In this session, you will learn about random forests, a type of data mining algorithm that can select from among a large number of variables those that are most important in determining the target or response variable to be explained. Unlike decision trees, the results of random forests generalize well to new data.
Lasso Regression
Lasso regression analysis is a shrinkage and variable selection method for linear regression models. The goal of lasso regression is to obtain the subset of predictors that minimizes prediction error for a quantitative response variable. The lasso does this by imposing a constraint on the model parameters that causes regression coefficients for some variables to shrink toward zero. Variables with a regression coefficient equal to zero after the shrinkage process are excluded from the model. Variables with non-zero regression coefficients variables are most strongly associated with the response variable. Explanatory variables can be either quantitative, categorical or both. In this session, you will apply and interpret a lasso regression analysis. You will also develop experience using k-fold cross validation to select the best fitting model and obtain a more accurate estimate of your model’s test error rate.
K-Means Cluster Analysis
Cluster analysis is an unsupervised machine learning method that partitions the observations in a data set into a smaller set of clusters where each observation belongs to only one cluster. The goal of cluster analysis is to group, or cluster, observations into subsets based on their similarity of responses on multiple variables. Clustering variables should be primarily quantitative variables, but binary variables may also be included. In this session, we will show you how to use k-means cluster analysis to identify clusters of observations in your data set. You will gain experience in interpreting cluster analysis results by using graphing methods to help you determine the number of clusters to interpret, and examining clustering variable means to evaluate the cluster profiles. Finally, you will get the opportunity to validate your cluster solution by examining differences between clusters on a variable not included in your cluster analysis.
Avaliações
Principais avaliações do APRENDIZADO DE MÁQUINA PARA ANÁLISE DE DADOS
Since it is a part of a specialization, the topics start somewhere in between and is only recommended for those who have completed the previous courses with in these specialization.
I enjoyed this course a lot. It's easy and I've learnt what I need to apply the machine learning techniques. Easy and simple. You don't need to be a mathematician.
More examples in coding and results are expected. So it is more convenient for students to compare different results and understand deeper
More Implementation oriented and less math also contains distracting background videos when explaining important concepts
Sobre Programa de cursos integrados Análise e interpretação de dadosAnálise e Interpretação de Dados
Learn SAS or Python programming, expand your knowledge of analytical methods and applications, and conduct original research to inform complex decisions.

Perguntas Frequentes – FAQ
Quando terei acesso às palestras e às tarefas?
O que recebo ao me inscrever nesta Especialização?
Is financial aid available?
Mais dúvidas? Visite o Central de Ajuda ao Aprendiz.