Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
Aravind Srinivas
Manage episode 272580648 series 2536330
Aravind Srinivas is a 3rd year PhD student at UC Berkeley advised by Prof. Abbeel.
He co-created and co-taught a grad course on Deep Unsupervised Learning at Berkeley.
Featured References
Data-Efficient Image Recognition with Contrastive Predictive Coding
Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord
Contrastive Unsupervised Representations for Reinforcement Learning
Aravind Srinivas, Michael Laskin, Pieter Abbeel
Reinforcement Learning with Augmented Data
Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel
Additional References
- CS294-158-SP20 Deep Unsupervised Learning, Berkeley
- Phasic Policy Gradient, Karl Cobbe, Jacob Hilton, Oleg Klimov, John Schulman
- Bootstrap your own latent: A new approach to self-supervised Learning , Grill et al 2020
61 tập
Manage episode 272580648 series 2536330
Aravind Srinivas is a 3rd year PhD student at UC Berkeley advised by Prof. Abbeel.
He co-created and co-taught a grad course on Deep Unsupervised Learning at Berkeley.
Featured References
Data-Efficient Image Recognition with Contrastive Predictive Coding
Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord
Contrastive Unsupervised Representations for Reinforcement Learning
Aravind Srinivas, Michael Laskin, Pieter Abbeel
Reinforcement Learning with Augmented Data
Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel
Additional References
- CS294-158-SP20 Deep Unsupervised Learning, Berkeley
- Phasic Policy Gradient, Karl Cobbe, Jacob Hilton, Oleg Klimov, John Schulman
- Bootstrap your own latent: A new approach to self-supervised Learning , Grill et al 2020
61 tập
Alle episoder
×Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.