kagisearch/llm-chess-puzzles

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kagisearch/llm-chess-puzzles)

kagisearch / llm-chess-puzzles

Benchmark LLM reasoning capability by solving chess puzzles.

☆91

Alternatives and similar repositories for llm-chess-puzzles

Users that are interested in llm-chess-puzzles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maxim-saplin / llm_chess
View on GitHub
LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games
☆113Updated this week
alessiodevoto / l2compress
View on GitHub
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆19Dec 13, 2024Updated last year
kaistAI / factual-knowledge-acquisition
View on GitHub
☆25Dec 12, 2025Updated 7 months ago
tarod13 / laplacian_dual_dynamics
View on GitHub
Dual optimization to learn laplacian eigenpairs in arbitrary spaces
☆18Dec 18, 2024Updated last year
cosmicoptima / indranet-explorer
View on GitHub
Indranet Explorer, a simulated browser
☆16Nov 12, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
SijiaCui / play-urts
View on GitHub
☆15Oct 28, 2024Updated last year
mklissa / dceo
View on GitHub
Learning diverse options through the Laplacian representation.
☆23Jan 5, 2024Updated 2 years ago
alacritty / termbenchbot
View on GitHub
Automated terminal emulator benchmarks
☆24Jun 22, 2026Updated last month
upiterbarg / hihack
View on GitHub
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
☆13Oct 30, 2023Updated 2 years ago
frinkleko / LIMIT-Sparse-Embedding
View on GitHub
Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…
☆16Sep 4, 2025Updated 10 months ago
spinbench / spinbench
View on GitHub
☆28May 30, 2026Updated last month
ahgamut / gcc
View on GitHub
☆17Apr 12, 2025Updated last year
rllabmcgill / rllabmcgill.github.io
View on GitHub
Production build of the new website
☆13May 19, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mklissa / phi_gcn
View on GitHub
Reward Propagation using Graph Convolutional Networks
☆13Jun 19, 2021Updated 5 years ago
Lizn-zn / Nesy-Programming
View on GitHub
☆10Oct 28, 2024Updated last year
tokeron / DiffusionLens
View on GitHub
☆16Jan 30, 2025Updated last year
jeffreykegler / personal
View on GitHub
Jeffrey Kegler personal web page
☆13Jul 6, 2023Updated 3 years ago
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
haotiansun14 / BBox-Adapter
View on GitHub
Lightweight Adapting for Black-Box Large Language Models
☆26Feb 15, 2024Updated 2 years ago
Exant64 / CWE
View on GitHub
☆13Updated this week
yifan12wu / rl-laplacian
View on GitHub
Learning Laplacian Representations in Reinforcement Learning
☆18Jan 2, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jordancurve / gpt-vs-stockfish
View on GitHub
gpt-3.5-turbo-instruct, prompted with PGN, vs Stockfish Level 4 on LiChess
☆15Sep 19, 2023Updated 2 years ago
arcee-ai / DAM
View on GitHub
☆56Nov 6, 2024Updated last year
bkj / basenet
View on GitHub
Pytorch NN helpers
☆20May 3, 2024Updated 2 years ago
cvndsh / rebus
View on GitHub
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
☆13Aug 13, 2024Updated last year
likenneth / q_probe
View on GitHub
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆40Jun 10, 2024Updated 2 years ago
camenduru / LGM-ply-to-glb-replicate
View on GitHub
☆16Feb 18, 2024Updated 2 years ago
nacloos / baba-is-ai
View on GitHub
Code for "Baba Is AI: Break the Rules to Beat the Benchmark"
☆49Sep 3, 2025Updated 10 months ago
hyintell / LLMSymbolic
View on GitHub
☆22Feb 29, 2024Updated 2 years ago
schwartz-lab-NLP / Tokens2Words
View on GitHub
☆16Apr 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
camenduru / daclip-uir-colab
View on GitHub
☆13Oct 12, 2023Updated 2 years ago
camenduru / Multi-LoRA-Composition-jupyter
View on GitHub
☆13Feb 28, 2024Updated 2 years ago
camenduru / champ-jupyter
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
jxiw / MambaByte
View on GitHub
[CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model
☆27Oct 12, 2024Updated last year
camenduru / Mix-of-Show-colab
View on GitHub
☆13Dec 18, 2023Updated 2 years ago
camenduru / Open-Sora-jupyter
View on GitHub
☆12Mar 18, 2024Updated 2 years ago
Foaster-ai / Werewolf-bench
View on GitHub
☆33Aug 30, 2025Updated 10 months ago