waterhorse1/ChessGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/waterhorse1/ChessGPT)

waterhorse1 / ChessGPT

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

☆142

Alternatives and similar repositories for ChessGPT

Users that are interested in ChessGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

waterhorse1 / NAC
View on GitHub
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
View on GitHub
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
waterhorse1 / LLM_Tree_Search
View on GitHub
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
☆287May 26, 2024Updated 2 years ago
hch1017 / TH_LLM
View on GitHub
☆20Jan 7, 2024Updated 2 years ago
godmoves / reinforcement_learning_collections
View on GitHub
A collection of deep reinforcement learning algorithm implementations
☆11Jan 9, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
onp / gmcr-py
View on GitHub
A Decision Support System (DSS) based on the Graph Model for Conflict Resolution (GMCR).
☆15Apr 4, 2020Updated 6 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆10Apr 23, 2021Updated 5 years ago
kennyderek / adap
View on GitHub
Adaptable Agent Populations via a Generative Model of Policies
☆12Oct 14, 2021Updated 4 years ago
mindagent / mindagent
View on GitHub
☆102Jun 12, 2024Updated 2 years ago
ScalingIntelligence / Archon
View on GitHub
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆207Mar 7, 2025Updated last year
google-deepmind / diplomacy
View on GitHub
☆60Apr 22, 2024Updated 2 years ago
thu-rllab / SOG
View on GitHub
Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".
☆22Feb 20, 2023Updated 3 years ago
logical-intelligence / proofs
View on GitHub
☆23Dec 3, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
FranxYao / GPT-Bargaining
View on GitHub
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆206May 24, 2023Updated 3 years ago
mukobi / welfare-diplomacy
View on GitHub
General-Sum variant of the game Diplomacy for evaluating AIs.
☆36Apr 2, 2024Updated 2 years ago
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 3 months ago
scottlogic-alex / prm800k-denorm
View on GitHub
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 3 years ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
NingMiao / InstaAug
View on GitHub
☆15Dec 28, 2022Updated 3 years ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
google-deepmind / pushworld
View on GitHub
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
☆95May 5, 2026Updated 2 months ago
zuzuba / CISR_NeurIPS20
View on GitHub
☆18Nov 16, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LiangZhang1996 / DataLight-old
View on GitHub
code for "Data Might be Enough: Bridge Real-World Traffic Signal Control Using Offline Reinforcement Learning"
☆11May 2, 2024Updated 2 years ago
BladeTransformerLLC / OvercookedGPT
View on GitHub
An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…
☆73May 15, 2023Updated 3 years ago
YuYang0901 / CLIP-spurious-finetune
View on GitHub
Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)
☆19Dec 15, 2023Updated 2 years ago
waterhorse1 / Natural-language-RL
View on GitHub
Natural Language Reinforcement Learning
☆101Jul 30, 2025Updated 11 months ago
djsutherland / vlfeat-ctypes
View on GitHub
A ctypes interface to a (very small) subset of vlfeat.
☆21Apr 9, 2019Updated 7 years ago
AnonymousIDforSubmission / GESA
View on GitHub
☆15Dec 13, 2022Updated 3 years ago
HumanCompatibleAI / human_aware_rl
View on GitHub
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆112Apr 17, 2023Updated 3 years ago
aicenter / openspiel_reproductions
View on GitHub
Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works
☆18Mar 2, 2021Updated 5 years ago
davidbrandfonbrener / imitation_pretraining
View on GitHub
☆20May 30, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Ali-Omrani / CCR
View on GitHub
Conceptual Construct Representations
☆11Feb 23, 2023Updated 3 years ago
hsvgbkhgbv / shapley-q-learning
View on GitHub
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
☆52Dec 4, 2023Updated 2 years ago
CR-Gjx / Suspicion-Agent
View on GitHub
The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"
☆166Nov 8, 2023Updated 2 years ago
princeton-nlp / SRL-NLC
View on GitHub
Safe Reinforcement Learning with Natural Language Constraints
☆17Oct 24, 2021Updated 4 years ago
ssbc / private-group-spec
View on GitHub
☆15Nov 7, 2023Updated 2 years ago
zhiyunfan / SEQ-SCD
View on GitHub
☆16Apr 21, 2022Updated 4 years ago
IBM / grammar2pddl
View on GitHub
Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…
☆19Sep 17, 2025Updated 10 months ago