bigrl-team/gear

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigrl-team/gear)

bigrl-team / gear

A distributed GPU-centric experience replay system for large AI models.

☆19

Alternatives and similar repositories for gear

Users that are interested in gear are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
xihuai18 / arxiv-sanity-x
View on GitHub
☆32Apr 12, 2026Updated 3 months ago
Stanford-ILIAD / Diverse-Conventions
View on GitHub
Exploring techniques to generate diverse conventions in multi-agent settings
☆16Nov 14, 2023Updated 2 years ago
automl / mdp-playground
View on GitHub
A python package to design and debug RL agents.
☆34Apr 2, 2026Updated 3 months ago
sjtu-marl / DPT-Agent
View on GitHub
This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…
☆61Nov 22, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liyang619 / COLE-Platform
View on GitHub
Overcooked human-AI experiment platform
☆41Dec 21, 2023Updated 2 years ago
lanl / Parallel-Quantum-Annealing
View on GitHub
Parallel Quantum Annealing
☆10Jan 7, 2023Updated 3 years ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
tomdbar / ecord
View on GitHub
Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".
☆13Jun 18, 2022Updated 4 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
connorbybee / hoim
View on GitHub
☆14Nov 28, 2023Updated 2 years ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
View on GitHub
☆30Aug 20, 2021Updated 4 years ago
ying-wen / malib_deprecated
View on GitHub
A Multi-agent Learning Framework
☆62May 10, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
efJerryYang / chatgpt-cli
View on GitHub
A markdown-supported command-line interface tool that connects to ChatGPT using OpenAI's API key.
☆48May 29, 2023Updated 3 years ago
Yiminghh / VertexEntanglement
View on GitHub
☆17Apr 14, 2024Updated 2 years ago
GusLovesMath / Llama3_MacSilicon
View on GitHub
Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…
☆11May 4, 2024Updated 2 years ago
liuguoyou / Face-Swapping-GAN-Pytorch
View on GitHub
A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)
☆20Nov 11, 2019Updated 6 years ago
ying-wen / time_series_prediction
View on GitHub
Time series prediction project for Information Retrieval and Data Mining(COMPGI15)
☆30Apr 16, 2016Updated 10 years ago
MassimoPerini / online-gnn-learning
View on GitHub
☆13Dec 16, 2021Updated 4 years ago
zuoxingdong / gym-recsys
View on GitHub
Customizable RecSys Simulator for OpenAI Gym
☆26Dec 7, 2021Updated 4 years ago
Shanghai-Digital-Brain-Laboratory / BDM-DB1
View on GitHub
A large-scale multi-modal pre-trained model
☆134Feb 7, 2023Updated 3 years ago
Natsu-Akatsuki / pybind11
View on GitHub
来记录一波 pybind11 实例~
☆18Nov 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
imperial-nsb / jbubble
View on GitHub
🫧 Differentiable microbubble dynamics in JAX.
☆15Jul 16, 2026Updated last week
ProsusAI / stack-eval
View on GitHub
Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288
☆20Oct 30, 2024Updated last year
UofT-EcoSystem / hfta
View on GitHub
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
☆32May 15, 2024Updated 2 years ago
divyahansg / RecurrentDPG
View on GitHub
CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)
☆10Jun 10, 2017Updated 9 years ago
HosnLS / Hierarchical-Language-Agent
View on GitHub
☆45Jan 9, 2024Updated 2 years ago
ldbc / data-sets-surf-repository
View on GitHub
☆16Feb 7, 2026Updated 5 months ago
PeterSH6 / MSPipe
View on GitHub
☆16Feb 20, 2024Updated 2 years ago
Gouet / QMIX-Starcraft
View on GitHub
☆17Dec 4, 2019Updated 6 years ago
happypu326 / CoCo-MILP
View on GitHub
This is the code of CoCo-MILP: Inter-Variable Contrastive and Intra-Constraint Competitive MILP Solution Prediction. AAAI 2026 Oral.
☆16May 13, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zeroxleo / HyperGT
View on GitHub
The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"
☆21Nov 23, 2024Updated last year
pmocz / advectiondiffusion-jax
View on GitHub
Solve the advection diffusion equations looped into an optimization problem with JAX/autodiff
☆14May 8, 2025Updated last year
Shanghai-Digital-Brain-Laboratory / DB-Football
View on GitHub
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
☆118Jan 16, 2024Updated 2 years ago
sdr2002 / RDPG-Biped
View on GitHub
Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains
☆12Oct 16, 2017Updated 8 years ago
matrl-project / matrl
View on GitHub
☆12Jan 30, 2021Updated 5 years ago
nshepperd / gumbel-rao-pytorch
View on GitHub
☆11Jul 25, 2021Updated 5 years ago
kaiwenw / JoinGym
View on GitHub
A lightweight RL environment for query optimization.
☆16Sep 13, 2024Updated last year