A distributed GPU-centric experience replay system for large AI models.
☆19Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for gear
Users that are interested in gear are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- A python package to design and debug RL agents.☆33Apr 2, 2026Updated 2 months ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆61Nov 22, 2025Updated 6 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- ☆144Jan 30, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Feb 22, 2023Updated 3 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆45Jan 9, 2024Updated 2 years ago
- we're building an AI to play the board game Diplomacy!☆35Mar 27, 2022Updated 4 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 5 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- ☆17Apr 14, 2024Updated 2 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆13May 10, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…☆40Jun 28, 2023Updated 2 years ago
- Customizable RecSys Simulator for OpenAI Gym☆26Dec 7, 2021Updated 4 years ago
- Some scripts to turn an OpenWrt router into a passive find3 scanner☆26Oct 11, 2020Updated 5 years ago
- Official code repository for the MICCAI 2025 paper "UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation"☆23Aug 13, 2025Updated 10 months ago
- A large-scale multi-modal pre-trained model☆134Feb 7, 2023Updated 3 years ago
- The repository for 'Unsupervised Learning for Combinatorial Optimization with Principled Proxy Design'☆16Oct 9, 2022Updated 3 years ago
- ☆16Feb 7, 2026Updated 4 months ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- ☆16Feb 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆55Apr 20, 2026Updated last month
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- Solve the advection diffusion equations looped into an optimization problem with JAX/autodiff☆14May 8, 2025Updated last year
- ☆10Apr 23, 2021Updated 5 years ago
- A lightweight RL environment for query optimization.☆16Sep 13, 2024Updated last year
- ☆16Jul 29, 2025Updated 10 months ago
- ☆12Jan 30, 2021Updated 5 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆116Jan 16, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of TBA for async LLM post-training.☆31Nov 5, 2025Updated 7 months ago
- ☆17Dec 4, 2019Updated 6 years ago
- Stochastic Gradient MCMC for Jax☆19May 19, 2025Updated last year
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆13Mar 5, 2025Updated last year
- ☆16Jul 13, 2022Updated 3 years ago
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆22Dec 6, 2024Updated last year