Regret: measuring the quality of exploration

video-placeholder
Loading...
Visualizar o programa do curso

Avaliações

4.2 (431 classificações)
  • 5 stars
    58,46%
  • 4 stars
    22,96%
  • 3 stars
    9,04%
  • 2 stars
    4,17%
  • 1 star
    5,33%
SF
8 de Abr de 2020

At times it felt like a bit more video material would be helpful to better understand the subject/gain deeper understanding.\n\nAnd fixing some of the notebooks would be helpful.

FZ
13 de Fev de 2019

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

Na lição
Exploration
In this final week you'll learn how to build better exploration strategies with a focus on contextual bandit setup. In honor track, you'll also learn how to apply reinforcement learning to train structured deep learning models.

Ministrado por

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

Explore nosso catálogo

Registre-se gratuitamente e obtenha recomendações, atualizações e ofertas personalizadas.