LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games
☆103Jun 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm_chess
Users that are interested in llm_chess are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A data visualisation of a 100 responses when asking local LLMs to imagine a random person.☆24Nov 4, 2024Updated last year
- ☆14Dec 17, 2025Updated 6 months ago
- Python package for pairwise ranking☆15Oct 17, 2024Updated last year
- Benchmark LLM reasoning capability by solving chess puzzles.☆91Apr 26, 2025Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of the Elo rating system with a sklearn interface.☆17Dec 2, 2022Updated 3 years ago
- State-space model perspective on rating systems (pairwise comparisons).☆21Feb 13, 2025Updated last year
- Compare openresty vs nginx + PUC_lua☆19Nov 3, 2023Updated 2 years ago
- ☆13Dec 20, 2019Updated 6 years ago
- Extensions to the Elo algorithm implemented in JAX☆14Jan 1, 2023Updated 3 years ago
- Implementing ASA's Win Probability Model☆18May 11, 2022Updated 4 years ago
- Toy O☆16Sep 21, 2024Updated last year
- Proceedings of the annual intercalary robot dance party in celebration of workshop on symposium about 2^6th birthdays; in particular, tha…☆21May 10, 2026Updated last month
- A concise list of CLI coding tools similar to Claude Code☆42Apr 13, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆11Oct 17, 2024Updated last year
- An artificial life experiment.☆12Jul 30, 2020Updated 5 years ago
- EARL - Extensible Attention-based Rocket League model☆18Oct 8, 2023Updated 2 years ago
- FIDE Data Pull☆16Jan 29, 2022Updated 4 years ago
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Updated this week
- Maia-2 is a new human-like neural network chess engine trained on millions of human games.☆144Mar 8, 2026Updated 3 months ago
- Papers that use Lichess data, study Lichess, or cite Lichess☆36Updated this week
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A python package for rating systems (like Elo)☆30Updated this week
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆44Nov 19, 2025Updated 7 months ago
- ☆13Nov 1, 2023Updated 2 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Elrond NFT minting platform POC (Also check out: www.elven.tools)☆12Aug 10, 2023Updated 2 years ago
- An installer tool for petals decentralized text generation network☆11Oct 1, 2023Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- ☆10Jun 26, 2024Updated last year
- a python convertion from the ruby implementation of Rémi Coulom's Whole-History Rating (WHR) algorithm.☆69Feb 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Jul 7, 2024Updated last year
- ☆19Apr 10, 2025Updated last year
- Improving Token-Based World Models with Parallel Observation Prediction (ICML 2024)☆14Feb 23, 2026Updated 3 months ago
- ☆12Apr 3, 2024Updated 2 years ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated last year
- KV cache compression via sparse coding☆18Oct 26, 2025Updated 7 months ago
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago