qgallouedec / deep_rl
Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.
☆22Updated 2 years ago
Alternatives and similar repositories for deep_rl:
Users that are interested in deep_rl are comparing it to the libraries listed below
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 7 months ago
- ☆41Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 8 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated last year
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 10 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- ☆23Updated 2 years ago
- ☆23Updated 9 months ago
- ☆48Updated 2 years ago
- ☆35Updated 2 years ago
- Model-based Policy Gradients☆30Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- ☆17Updated 11 months ago
- ☆15Updated 10 months ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆22Updated 3 months ago
- a modular reinforcement learning library with JAX agents☆22Updated last year
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- ☆31Updated 11 months ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated last year
- ☆21Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 9 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆104Updated last year