Advantage actor-critic

Loading...
Visualizar o programa do curso

Avaliações

4.1 (232 classificações)
  • 5 stars
    121 ratings
  • 4 stars
    58 ratings
  • 3 stars
    24 ratings
  • 2 stars
    11 ratings
  • 1 star
    18 ratings
VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

AH

Aug 17, 2018

Learned a lot. The pace is quick and the assignment is challenging sometimes

Na lição
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

Ministrado por

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

Explore nosso catálogo

Registre-se gratuitamente e obtenha recomendações, atualizações e ofertas personalizadas.