luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for discovered-policy-optimisation
- ☆17Updated 5 months ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 7 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆13Updated 3 weeks ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 4 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- ☆13Updated 4 months ago
- ☆63Updated 3 months ago
- POPGym Library in JAX☆11Updated 7 months ago
- A collection of matrix games in JAX☆10Updated 2 weeks ago
- Reinforcement Learning inside a 3D soccer simulation☆25Updated 2 months ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆12Updated 5 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆13Updated 2 years ago
- Simple JAX Graphics Library.☆23Updated 2 weeks ago
- Highly scalable 2D JAX physics engine.☆37Updated last week
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆15Updated 3 weeks ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Corax: Core RL in JAX☆35Updated 9 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆47Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated last week
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- ☆29Updated 8 months ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆18Updated 2 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆49Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 3 months ago
- ☆20Updated 6 months ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago