waterhorse1 / ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
☆97Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ChessGPT
- ☆73Updated 4 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆102Updated 7 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆77Updated 2 weeks ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆219Updated 2 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆120Updated 6 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆91Updated last month
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆118Updated last year
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆32Updated last week
- BASALT Benchmark datasets, evaluation code and agent training example.☆19Updated 11 months ago
- Implementation of TWOSOME☆47Updated 6 months ago
- Code for Contrastive Preference Learning (CPL)☆153Updated 8 months ago
- Rewarded soups official implementation☆49Updated last year
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆20Updated last year
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆42Updated last year
- ☆11Updated 7 months ago
- ☆64Updated this week
- Minimal but scalable implementation of large language models in JAX☆25Updated last week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated last month
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆213Updated 3 weeks ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated this week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆26Updated 2 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Interpreting how transformers simulate agents performing RL tasks☆69Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆93Updated 3 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆150Updated last year
- Bootstrapping ARC☆38Updated this week