waterhorse1 / ChessGPTView external linksLinks
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆131Oct 26, 2023Updated 2 years ago
Alternatives and similar repositories for ChessGPT
Users that are interested in ChessGPT are comparing it to the libraries listed below
Sorting:
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- ☆26May 30, 2023Updated 2 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- ☆12Jan 30, 2021Updated 5 years ago
- ☆20Dec 3, 2025Updated 2 months ago
- ☆13Nov 20, 2023Updated 2 years ago
- A Decision Support System (DSS) based on the Graph Model for Conflict Resolution (GMCR).☆15Apr 4, 2020Updated 5 years ago
- ☆11Apr 23, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆285May 26, 2024Updated last year
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 2 years ago
- ☆61Apr 22, 2024Updated last year
- ☆18Jul 24, 2023Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Oct 7, 2025Updated 4 months ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆61Aug 27, 2022Updated 3 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 4 months ago
- Simple notebooks to learn diffusion models on toy datasets☆17Feb 9, 2023Updated 3 years ago
- Stress test for parallel disk i/o using git and pnpm☆29Oct 14, 2025Updated 4 months ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- A ctypes interface to a (very small) subset of vlfeat.☆21Apr 9, 2019Updated 6 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆18Sep 17, 2025Updated 4 months ago
- Code release of paper "ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning" (NeurIPS 2023)☆17Dec 30, 2023Updated 2 years ago
- ☆16Mar 22, 2024Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Mar 7, 2025Updated 11 months ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- ☆18May 30, 2023Updated 2 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆21Feb 20, 2023Updated 2 years ago
- ☆20Mar 19, 2024Updated last year
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆22Jul 6, 2023Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆25Oct 27, 2022Updated 3 years ago
- ☆18Nov 16, 2020Updated 5 years ago