Негізгі бет MIT 6.S191 (2022): Reinforcement Learning

Күн бұрын

MIT 6.S191 (2022): Reinforcement Learning

Рет қаралды 83,752

Alexander Amini

1 1

Пікірлер: 26

@liangertyguo100
2 жыл бұрын
Best introductory video for DRL. Read lots of books or reviews but none of them explained it so clearly. Thank u so much for the excellent presentation
@josephcheung2311
Жыл бұрын
This course is excellent. The two instructors explain the concepts very well.
@martinsichibeya3405
2 жыл бұрын
my very favorite... honestly i so much love this DL course... thanks for your efforts.
@midimusicforever
2 жыл бұрын
Another step towards the singularity.
@tatendamuzenda8442
Жыл бұрын
loving the course so easy to understand
@alexandertimofeev7626
2 жыл бұрын
Nice lecture! However, it was hard for me to follow the idea of loss function at 44:44. So it works if R_t is negative for low rewards and positive for high rewards, right?
@tommyholladay
2 жыл бұрын
We minimize the loss. By minimizing negative log of the probability multiplied by the reward, we are actually optimizing for the higher reward, which in a sense makes it gradient ascent.
@uk_with_jatin3512
Жыл бұрын
@@tommyholladay totally agreed, but an easier statement to explain this would be that we are taking the negative of loss likelihood because for high values, we want to proceed towards that direction in our algorithm, so, we user the negative to reverse the direction of gradient.
@chrcheel
2 жыл бұрын
Thanks a lot!
@darshank8748
2 жыл бұрын
Great Work !!!
@helloansuman
2 жыл бұрын
Amazing ❤️
@nguyenvandien8996
2 жыл бұрын
Hello, Amini. Why can't I see the slides of this video on the homepage?
@jamesgambrah58
Жыл бұрын
This is awesome, but how can some of us watching this recorded video on KZitem have the opportunity to practice with VISTA, is there any arrangement for us.
@AAmini
Жыл бұрын
Yes! VISTA is available to the public as well here: github.com/vista-simulator/vista Also checkout the VISTA related lab3 on the open source software labs for the class for examples.
@jamesgambrah58
Жыл бұрын
@@AAmini Thanks Prof., I will explore it, Data Science community will forever appreciate your contribution to the growth of the field.
@ahmedb2559
Жыл бұрын
Thank you !
@hassinijalil5533
Жыл бұрын
Hello I have a question, when we do the training, what data is used to train the agent? Is it the environment (Carla for exemple ) ? And can we transform the environment into images ? I hope to reply me sir i have a project in university . and thank you .
@khaoticttv6506
2 жыл бұрын
Hey, do you use a Mac or a windows machine with Ubuntu installed on it?
@MarkSimithraaratchy
2 жыл бұрын
Excellent lecture; thank you.
@theneumann7
Жыл бұрын
👏
@andreas.karatzas
2 жыл бұрын
Now, that's the good stuff!!!
@harmhoeks5996
2 жыл бұрын
Why Tesla has 1500 data labelers instead of reinforcement learning?
@RC-bm9mf
2 жыл бұрын
Because actual accidents are much costly.
@kellybrower301
Жыл бұрын
Hallucinate? 🤔😭
@zhihuiyuze
2 жыл бұрын
Starcraft 2!!!!
@buoyrina9669
2 жыл бұрын
Looking to it