bigrl-team / gear
A distributed GPU-centric experience replay system for large AI models.
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gear
- ☆18Updated 5 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- A Really Scalable RL Framework to 10k+ CPUs☆17Updated 8 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆48Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- ☆22Updated 10 months ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆9Updated 4 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- ☆28Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆23Updated last year
- Code repo for ICML'23 Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning☆35Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆53Updated 9 months ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- ☆21Updated 4 years ago
- ☆34Updated last year
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆30Updated 11 months ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- ☆37Updated 2 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆33Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆24Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆25Updated 4 months ago