zhixuan-lin / cs285-fall-2019
My solutions to CS285 2019 Fall of UC Berkeley
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for cs285-fall-2019
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated last month
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆42Updated last year
- Berkeley CS285 2019 homework solution☆30Updated last year
- Guided-Meta Policy Search☆41Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- implementation of our self-guided and self-regularized actor-critic algorithm☆30Updated last year
- Pytorch starter code for UC Berkeley's cs285 assignments☆70Updated 2 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆28Updated 3 years ago
- Comp 781 Project☆8Updated 5 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆36Updated 2 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 4 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Generalised UDRL☆37Updated 2 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆19Updated 4 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆46Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Updated 6 years ago
- ☆53Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Great resources for learning optimal control☆17Updated 5 years ago