waterhorse1 / ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆107Updated last year
Alternatives and similar repositories for ChessGPT:
Users that are interested in ChessGPT are comparing it to the libraries listed below
- ☆76Updated 6 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆125Updated 9 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆127Updated 9 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆93Updated this week
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆100Updated 4 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆209Updated 2 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆69Updated last month
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆90Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆114Updated 2 months ago
- ☆209Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆127Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- fast + parallel AlphaZero in JAX☆90Updated last month
- Scaling scaling laws with board games.☆45Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆233Updated 5 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆135Updated 2 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆108Updated this week
- ☆140Updated 8 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆64Updated last year
- BASALT Benchmark datasets, evaluation code and agent training example.☆20Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆47Updated 7 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆89Updated 4 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆203Updated last year
- ☆22Updated last year
- Play chess against large language models.☆42Updated 11 months ago
- ☆69Updated last year
- ☆13Updated 10 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆157Updated last year
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- Implementation of TWOSOME☆62Updated 2 weeks ago