ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆21Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for pax
- An Open-Ended Agentic Simulator☆22Updated 2 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated last year
- ☆17Updated 4 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆11Updated last week
- ☆61Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆14Updated last week
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated last week
- ☆12Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- Reinforcement Learning inside a 3D soccer simulation☆24Updated last month
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆52Updated 9 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆34Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 3 months ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Reinforcement learning on general 2D physics environments in JAX☆10Updated this week
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 6 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 5 months ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆43Updated 2 years ago
- ☆28Updated last year
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆12Updated 4 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆23Updated 4 months ago