Metta-AI / metta
A reinforcement learning codebase focusing on the emergence of cooperation and alignment in multi-agent AI systems.
☆33Updated this week
Alternatives and similar repositories for metta
Users that are interested in metta are comparing it to the libraries listed below
Sorting:
- An Open-Ended Agentic Simulator☆49Updated 9 months ago
- ☆77Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 3 weeks ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆97Updated 6 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year
- Efficient baselines for autocurricula in JAX.☆188Updated 8 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- ☆79Updated 6 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆71Updated 3 weeks ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆49Updated 4 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆17Updated 2 months ago
- ☆21Updated 8 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆55Updated 2 years ago
- General Modules for JAX☆65Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- ☆19Updated this week
- Evaluating long-term memory of reinforcement learning algorithms☆142Updated last year
- Object Centric Atari games☆78Updated this week
- A collection of matrix games in JAX☆11Updated 5 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆119Updated 3 weeks ago
- Exploitability calculation for imperfect-information game benchmarks☆24Updated last month
- Accelerated minigrid environments with JAX☆135Updated this week