From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..
☆282Jun 16, 2024Updated last year
Alternatives and similar repositories for DeepRLInTheWorld
Users that are interested in DeepRLInTheWorld are comparing it to the libraries listed below
Sorting:
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆46Feb 27, 2026Updated last week
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated this week
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆867Aug 12, 2024Updated last year
- Library for Model Based RL☆1,054Jul 12, 2024Updated last year
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆810Dec 1, 2025Updated 3 months ago
- code for CoRL 2020 paper "Contrastive Variational Model-Based Reinforcement Learning for Complex Observations"☆24Dec 29, 2021Updated 4 years ago
- RL Environments in JAX 🌍☆868May 30, 2025Updated 9 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆37May 19, 2023Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆753Oct 26, 2022Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆164Jun 23, 2023Updated 2 years ago
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆9,213Jul 8, 2025Updated 8 months ago
- ☆20May 22, 2022Updated 3 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- Simple JAX Graphics Library.☆36Nov 3, 2024Updated last year
- Really Fast End-to-End Jax RL Implementations☆1,028Sep 9, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆205Jun 18, 2024Updated last year
- Gym env for Slay the Spire☆17Dec 31, 2024Updated last year
- ☆13Jun 3, 2022Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 3 years ago
- Simple and easily configurable grid world environments for reinforcement learning☆2,407Mar 2, 2026Updated last week
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Jul 5, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Jax/Flax Implementation of TD-MPC2☆72Jan 12, 2026Updated last month
- ☆329Dec 19, 2024Updated last year
- ☆59Sep 22, 2022Updated 3 years ago
- A collection of reference environments for offline reinforcement learning☆1,656Nov 18, 2024Updated last year
- JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️☆325Dec 16, 2025Updated 2 months ago
- [ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"☆233Dec 27, 2022Updated 3 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆51Sep 30, 2022Updated 3 years ago
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆431May 31, 2022Updated 3 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,274Aug 12, 2024Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Aug 20, 2024Updated last year