bigrl-team / gear
A distributed GPU-centric experience replay system for large AI models.
☆17Updated last year
Alternatives and similar repositories for gear:
Users that are interested in gear are comparing it to the libraries listed below
- A Really Scalable RL Framework to 10k+ CPUs☆31Updated last year
- ☆18Updated 6 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Representation Learning in RL☆16Updated 2 years ago
- ☆19Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆45Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆24Updated 2 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 5 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆43Updated 4 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 3 years ago
- ☆11Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆52Updated last year
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆9Updated 2 months ago
- Extending the Neural Graph Algorithm Executor☆13Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆29Updated 9 months ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆33Updated last year
- ICLR'22 Programmatic Reinforcement Learning☆16Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆46Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆69Updated 2 years ago
- WIP implementation of https://arxiv.org/pdf/1901.08162.pdf☆9Updated 5 years ago
- Winner of NeurIPS 2021 student leaderboard. Self-bootstrapping bayesian optimization for SCIP configuration using GNNs.☆13Updated 2 years ago