Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
Shimon Whiteson
Manage episode 279413376 series 2536330
Shimon Whiteson is a Professor of Computer Science at Oxford University, the head of WhiRL, the Whiteson Research Lab at Oxford, and Head of Research at Waymo UK.
Featured References
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson
Additional References
- Shimon Whiteson - Multi-agent RL, MIT Embodied Intelligence Seminar
- The StarCraft Multi-Agent Challenge, Samvelyan et al 2019
- Direct Policy Transfer with Hidden Parameter Markov Decision Processes, Yao et al 2018
- Value-Decomposition Networks For Cooperative Multi-Agent Learning, Sunehag et al 2017
- Whiteson Research Lab
- Waymo acquires Latent Logic to accelerate progress towards safe, driverless vehicles, Oxford News
- Waymo
61 tập
Manage episode 279413376 series 2536330
Shimon Whiteson is a Professor of Computer Science at Oxford University, the head of WhiRL, the Whiteson Research Lab at Oxford, and Head of Research at Waymo UK.
Featured References
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson
Additional References
- Shimon Whiteson - Multi-agent RL, MIT Embodied Intelligence Seminar
- The StarCraft Multi-Agent Challenge, Samvelyan et al 2019
- Direct Policy Transfer with Hidden Parameter Markov Decision Processes, Yao et al 2018
- Value-Decomposition Networks For Cooperative Multi-Agent Learning, Sunehag et al 2017
- Whiteson Research Lab
- Waymo acquires Latent Logic to accelerate progress towards safe, driverless vehicles, Oxford News
- Waymo
61 tập
Tất cả các tập
×Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.