skezle / owlLinks
☆16Updated 3 years ago
Alternatives and similar repositories for owl
Users that are interested in owl are comparing it to the libraries listed below
Sorting:
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- ☆16Updated 3 years ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆124Updated 2 years ago
- ☆100Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 2 years ago
- ☆16Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Updated 4 years ago
- ☆54Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆33Updated 2 years ago
- ☆57Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆59Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- ☆26Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Updated 2 years ago
- ☆31Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆11Updated 5 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆192Updated 2 years ago
- ☆32Updated 4 years ago
- Conservative Q Learning on top of SAC☆132Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- ☆14Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆64Updated 2 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Updated 2 years ago