A distributed GPU-centric experience replay system for large AI models.
☆19Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for gear
Users that are interested in gear are comparing it to the libraries listed below
Sorting:
- ☆14Mar 5, 2024Updated 2 years ago
- ☆28Feb 17, 2026Updated 2 weeks ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆17Dec 4, 2019Updated 6 years ago
- ☆145Jan 30, 2025Updated last year
- ☆30Aug 20, 2021Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 3 months ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- we're building an AI to play the board game Diplomacy!☆35Mar 27, 2022Updated 3 years ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆36Aug 29, 2025Updated 6 months ago
- ☆42Jan 9, 2024Updated 2 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- Official code repository for the MICCAI 2025 paper "UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation"☆17Aug 13, 2025Updated 6 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆52Nov 22, 2025Updated 3 months ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- Datacenter simulation toolkit for the OpenDC project☆10Aug 24, 2020Updated 5 years ago
- ☆14Mar 21, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Parallel Quantum Annealing☆10Jan 7, 2023Updated 3 years ago
- repository for notes and data from machine learning studies☆11Dec 16, 2019Updated 6 years ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆42May 8, 2024Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆15Jun 20, 2025Updated 8 months ago
- Solutions to assignments in course- "Bitcoin and Cryptocurrency Technologies", offered by coursera, Princeton University☆11Jun 28, 2018Updated 7 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- ☆16Feb 22, 2025Updated last year
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated 2 weeks ago
- The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…☆40Jun 28, 2023Updated 2 years ago
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆54Nov 22, 2025Updated 3 months ago
- Meta-RL Model-Based Algorithm☆43Apr 30, 2025Updated 10 months ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Jan 29, 2019Updated 7 years ago
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆46Oct 16, 2024Updated last year
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year