(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆138Oct 26, 2023Updated 2 years ago
Alternatives and similar repositories for ChessGPT
Users that are interested in ChessGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆287May 26, 2024Updated last year
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆24Oct 4, 2023Updated 2 years ago
- ☆12Apr 17, 2024Updated 2 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆26May 30, 2023Updated 2 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- Fork of https://github.com/lichess-org/chessground Supports boards up to 16x16☆46Mar 25, 2026Updated last month
- ☆100Jun 12, 2024Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆200Mar 7, 2025Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Usage examples for chessground☆62Mar 6, 2026Updated 2 months ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆22Feb 20, 2023Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- Directed masked autoencoders☆14Mar 25, 2026Updated last month
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- ☆14Dec 28, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Oct 7, 2025Updated 7 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆73May 15, 2023Updated 2 years ago
- Reinforced Multi-LLM Agents training☆82Jan 18, 2026Updated 3 months ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆93Updated this week
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 9 months ago
- ☆19Apr 26, 2026Updated last week
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆112Apr 17, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆51Sep 19, 2025Updated 7 months ago
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 10 months ago
- Code implementation of "Information Design in Multi-Agent Reinforcement Learning"☆15Aug 18, 2023Updated 2 years ago
- The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"☆161Nov 8, 2023Updated 2 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆51Dec 4, 2023Updated 2 years ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆69Sep 6, 2024Updated last year