maxim-saplin / llm_chessLinks
LLM Chess - Large Language Models Competing in Chess
☆67Updated 3 weeks ago
Alternatives and similar repositories for llm_chess
Users that are interested in llm_chess are comparing it to the libraries listed below
Sorting:
- Benchmarking Agentic LLM and VLM Reasoning On Games☆201Updated 2 months ago
- ☆124Updated 9 months ago
- ☆167Updated 9 months ago
- A repo to evaluate various LLM's chess playing abilities.☆82Updated last year
- SoTA Approach for ARC-AGI 2☆103Updated last month
- ☆137Updated 2 months ago
- Draw more samples☆194Updated last year
- ☆193Updated 2 months ago
- ☆491Updated 4 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 8 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- ☆104Updated this week
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- ☆93Updated 4 months ago
- Testing baseline LLMs performance across various models☆316Updated last week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆330Updated 11 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆286Updated last week
- Our solution for the arc challenge 2024☆180Updated 4 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆44Updated 6 months ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆300Updated last year
- GRadient-INformed MoE☆264Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 3 months ago
- smol models are fun too☆93Updated 11 months ago
- ☆163Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated 11 months ago
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- Bootstrapping ARC☆142Updated 10 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 8 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆56Updated 5 months ago