(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆141Oct 26, 2023Updated 2 years ago
Alternatives and similar repositories for ChessGPT
Users that are interested in ChessGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- Neural network-based chess engine capable of natural language commentary☆530Nov 15, 2022Updated 3 years ago
- UCI chess engine☆11Jun 6, 2026Updated 3 weeks ago
- Embedding based chess position search and embedding learning for chess positions☆18May 23, 2026Updated last month
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆287May 26, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- A tool for lc0 training data operations☆29May 5, 2024Updated 2 years ago
- ☆12Apr 17, 2024Updated 2 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆26May 30, 2023Updated 3 years ago
- A Decision Support System (DSS) based on the Graph Model for Conflict Resolution (GMCR).☆15Apr 4, 2020Updated 6 years ago
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 3 years ago
- ☆60Apr 22, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Jan 30, 2021Updated 5 years ago
- ☆102Jun 12, 2024Updated 2 years ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆204Mar 7, 2025Updated last year
- Utilities for experimenting with leela-chess-zero☆41Mar 3, 2022Updated 4 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆26Oct 27, 2022Updated 3 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆22Feb 20, 2023Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 3 years ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆36Apr 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Directed masked autoencoders☆15Mar 25, 2026Updated 3 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- [AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".☆21Jul 26, 2025Updated 11 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- AobaKomaochi. Shogi komaochi (handicap game) Deep reinforcement learning.☆15Jan 12, 2025Updated last year
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Oct 7, 2025Updated 8 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆73May 15, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Nov 16, 2020Updated 5 years ago
- Reinforced Multi-LLM Agents training☆86Jan 18, 2026Updated 5 months ago
- ☆21Mar 19, 2024Updated 2 years ago
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- Natural Language Reinforcement Learning☆101Jul 30, 2025Updated 10 months ago
- Simple ChatGPT interface for shell and macOS Alfred workflow☆13Oct 3, 2025Updated 8 months ago
- Code for paper "Personalized Counterfactual Fairness in Recommendation" (a.k.a. "Towards Personalized Fairness based on Causal Notion")☆18Nov 5, 2021Updated 4 years ago