Chevron Left
Voltar para Scalable Machine Learning on Big Data using Apache Spark

Comentários e feedback de alunos de Scalable Machine Learning on Big Data using Apache Spark da instituição IBM

104 classificações
15 avaliações

Sobre o curso

This course will empower you with the skills to scale data science and machine learning (ML) tasks on Big Data sets using Apache Spark. Most real world machine learning work involves very large data sets that go beyond the CPU, memory and storage limitations of a single computer. Apache Spark is an open source framework that leverages cluster computing and distributed storage to process extremely large data sets in an efficient and cost effective manner. Therefore an applied knowledge of working with Apache Spark is a great asset and potential differentiator for a Machine Learning engineer. After completing this course, you will be able to: - gain a practical understanding of Apache Spark, and apply it to solve machine learning problems involving both small and big data - understand how parallel code is written, capable of running on thousands of CPUs. - make use of large scale compute clusters to apply machine learning algorithms on Petabytes of data using Apache SparkML Pipelines. - eliminate out-of-memory errors generated by traditional machine learning frameworks when data doesn’t fit in a computer's main memory - test thousands of different ML models in parallel to find the best performing one – a technique used by many successful Kagglers - (Optional) run SQL statements on very large data sets using Apache SparkSQL and the Apache Spark DataFrame API. Enrol now to learn the machine learning techniques for working with Big Data that have been successfully applied by companies like Alibaba, Apple, Amazon, Baidu, eBay, IBM, NASA, Samsung, SAP, TripAdvisor, Yahoo!, Zalando and many others. NOTE: You will practice running machine learning tasks hands-on on an Apache Spark cluster provided by IBM at no charge during the course which you can continue to use afterwards. Prerequisites: - basic python programming - basic machine learning (optional introduction videos are provided in this course as well) - basic SQL skills for optional content The following courses are recommended before taking this class (unless you already have the skills) or similar or similar for optional lectures...

Melhores avaliações


Sep 24, 2019

In very simple and crisp way a lot of details are covered about Apache Spark. Very good way to start.


Sep 30, 2019

Great tutor, he loves to keep things simple and to the point. Loved the course.

Filtrar por:

1 — 15 de {totalReviews} Avaliações para o Scalable Machine Learning on Big Data using Apache Spark

por Abdelrahman g e f

Sep 16, 2019

the accent of the instructor was very hard to understand him during explanation but he was good instructor at all

por Amit T

Sep 24, 2019

In very simple and crisp way a lot of details are covered about Apache Spark. Very good way to start.

por Waqas K O

Sep 30, 2019

Great tutor, he loves to keep things simple and to the point. Loved the course.

por Philippe D

Oct 23, 2019

The course is interesting but I think a few things could be improved.

For instance, some code examples from the videos are outdated because of a newer spark version. The video was edited to mention that the github repo was updated but I was unable to find the updated code.

One (maybe more?) of the videos was done in a car; It makes the whole thing feel unprofessional even though the teacher's skills far exceed the requirements for teaching this course.

As others have mentioned, the teacher's accent can be a bit difficult to understand at times but to me, this does not affect the quality of the course. The teacher always seems interested and is smiling most of the time which might seem unimportant but it still sets a positive mood for the lectures which is great.

All in all, the course is interesting and it provides a good introduction to Machine Learning using Apache Spark.

por Lewis m

Nov 12, 2019

So far the questions and quizes seem unrelated to machine learning. The videos are poorly set out, with breif explanations and the whole thing seems rushed.

por Jay P

Sep 29, 2019


por Ruslan I M V

Nov 09, 2019

Apache spark is great and powerful but the lectures are not clear and long.

por Ujjwal G

Nov 11, 2019

For a intorductory course it is very good. Do not expect anything too advanced.

por Yuting K

Oct 03, 2019

The quality of the videos could be better

por Gherbi H

Nov 15, 2019

A very good course and will recommend it for anyone who has Apache Spark experience and wants to get an introduction to ML lib and machine learning in Apache Spark, the assignment submissions need some work but other than that a very good introductive course.

por Yasser E H

Oct 27, 2019

Really interesting content

Unclear coding explanations

Limitations with the free access in IBM Watson Studio

por Suresh C

Nov 02, 2019

There should be more details about Apache spark and some examples

por Jair J C C

Oct 19, 2019

Very Good, but I think the course needs more challenging exams

por Farrukh N A

Nov 06, 2019

Course can be improved by focusing more on ML algorithms.... Explanation of GBT and Random Forest was not provided. But they were used.

por Benhur O J

Oct 08, 2019

Too superficial. The python example codes are very cryptic and not very well commented. The programming videos are very difficult to follow because the instructor is literally reading the code instead of explaining it.