ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆24Updated 11 months ago
Alternatives and similar repositories for pax:
Users that are interested in pax are comparing it to the libraries listed below
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- POPGym Library in JAX☆11Updated 11 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 4 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆13Updated 9 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆25Updated 8 months ago
- ☆20Updated 9 months ago
- ☆13Updated 8 months ago
- ☆35Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Reinforcement Learning inside a 3D soccer simulation☆25Updated 6 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Synchronized Curriculum Learning for RL Agents☆41Updated this week
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆40Updated 3 years ago
- ☆31Updated last year
- ☆41Updated last year
- ☆30Updated 4 years ago
- Code for magnetic mirror descent.☆15Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated 10 months ago
- ☆43Updated last year