ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆24Updated last year
Alternatives and similar repositories for pax:
Users that are interested in pax are comparing it to the libraries listed below
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- POPGym Library in JAX☆11Updated last year
- ☆20Updated 9 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆30Updated 4 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 5 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated last year
- ☆44Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆13Updated 9 months ago
- ☆31Updated last year
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆13Updated 4 months ago
- ☆35Updated 2 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- ☆31Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆32Updated 8 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆26Updated last year