Rohin Shah
TalkRL: The Reinforcement Learning Podcast - En podcast af Robin Ranjit Singh Chauhan
Kategorier:
DeepMind Research Scientist Dr. Rohin Shah on Value Alignment, Learning from Human feedback, Assistance paradigm, the BASALT MineRL competition, his Alignment Newsletter, and more!