Regret: measuring the quality of exploration

Loading...
Visualizar o programa do curso

Avaliações

4.2 (383 classificações)
  • 5 stars
    56.39%
  • 4 stars
    23.75%
  • 3 stars
    9.13%
  • 2 stars
    4.69%
  • 1 star
    6%
FZ

Feb 14, 2019

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

Na lição
Exploration
In this final week you'll learn how to build better exploration strategies with a focus on contextual bandit setup. In honor track, you'll also learn how to apply reinforcement learning to train structured deep learning models.

Ministrado por

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

Explore nosso catálogo

Registre-se gratuitamente e obtenha recomendações, atualizações e ofertas personalizadas.