maxim-saplin / llm_chess
LLM Chess - Large Language Models Competing in Chess
☆39Updated this week
Alternatives and similar repositories for llm_chess:
Users that are interested in llm_chess are comparing it to the libraries listed below
- A virtual environment for developing and evaluating automated scientific discovery agents.☆144Updated last month
- ☆84Updated last week
- Benchmarking Agentic LLM and VLM Reasoning On Games☆129Updated 2 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆171Updated 3 months ago
- A repo to evaluate various LLM's chess playing abilities.☆81Updated last year
- ☆108Updated 4 months ago
- ☆143Updated 2 weeks ago
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 2 months ago
- ☆53Updated 2 months ago
- Train your own SOTA deductive reasoning model☆88Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- ☆106Updated 4 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆53Updated 8 months ago
- Draw more samples☆189Updated 10 months ago
- ☆114Updated 2 months ago
- Our solution for the arc challenge 2024☆134Updated last month
- Clue inspired puzzles for testing LLM deduction abilities☆33Updated last month
- smolLM with Entropix sampler on pytorch☆151Updated 5 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- Benchmark LLM reasoning capability by solving chess puzzles.☆75Updated 11 months ago
- ☆166Updated last week
- ☆81Updated 3 weeks ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆139Updated this week
- A simplified implementation for experimenting with Reinforcement Learning (RL) on GSM8K, inspired by RLVR and Deepseek R1. This repositor…☆78Updated 2 months ago
- Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.☆52Updated 3 weeks ago
- EvaByte: Efficient Byte-level Language Models at Scale☆88Updated this week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆187Updated 4 months ago
- Testing baseline LLMs performance across various models☆257Updated last week
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆85Updated 2 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆81Updated last month