sdpkjc / abcdrl
Modular Single-file Reinfocement Learning Algorithms Library
☆37Updated last year
Related projects ⓘ
Alternatives and complementary repositories for abcdrl
- ☆34Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- ☆37Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated 9 months ago
- Source files to replicate experiments in my ICLR 2022 paper.☆61Updated 4 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- a modular reinforcement learning library with JAX agents☆22Updated 11 months ago
- An Open-Ended Agentic Simulator☆22Updated 2 months ago
- ☆20Updated 6 months ago
- ☆21Updated 6 months ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- ☆23Updated 2 years ago
- ☆42Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- Corax: Core RL in JAX☆34Updated 8 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 6 months ago
- Benchmarked implementations of Offline RL Algorithms.☆64Updated 6 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆33Updated 8 months ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 6 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆25Updated 4 months ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆15Updated last year
- ☆18Updated last year