bigrl-team / gear
A distributed GPU-centric experience replay system for large AI models.
☆16Updated last year
Alternatives and similar repositories for gear:
Users that are interested in gear are comparing it to the libraries listed below
- ☆18Updated 5 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆23Updated 11 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆53Updated 6 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆24Updated 2 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- An unofficial implementation for online decision transformer☆39Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆42Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 10 months ago
- ☆29Updated 2 years ago
- ☆11Updated 10 months ago
- ☆19Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆57Updated last year
- ☆19Updated 7 months ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆12Updated 2 weeks ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆23Updated last year
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆24Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 6 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆12Updated last year
- A2C is a special case of PPO!☆19Updated 2 years ago
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆32Updated last year