kagisearch / llm-chess-puzzlesView external linksLinks
Benchmark LLM reasoning capability by solving chess puzzles.
☆90Apr 26, 2025Updated 9 months ago
Alternatives and similar repositories for llm-chess-puzzles
Users that are interested in llm-chess-puzzles are comparing it to the libraries listed below
Sorting:
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- ☆10Oct 28, 2024Updated last year
- Automated terminal emulator benchmarks☆22Jan 14, 2026Updated last month
- Indranet Explorer, a simulated browser☆16Nov 12, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- an opinionated Wayland clipboard manager☆14Oct 24, 2025Updated 3 months ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆16Dec 18, 2024Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Mar 14, 2024Updated last year
- ☆18Jan 3, 2025Updated last year
- The official implementation of the DAC 2024 paper GQA-LUT☆20Dec 20, 2024Updated last year
- 📚 Build knowledge bases for RAG☆31Jul 3, 2025Updated 7 months ago
- Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Docu…☆23Dec 21, 2024Updated last year
- AI web parser library + CLI☆48May 5, 2025Updated 9 months ago
- ☆24Dec 12, 2025Updated 2 months ago
- ☆22Feb 29, 2024Updated last year
- gpt completions in vscode☆35Mar 24, 2023Updated 2 years ago
- HTML Cleaner Add-on for Anki☆23Jun 25, 2020Updated 5 years ago
- ☆56Nov 6, 2024Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated this week
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆20Oct 24, 2025Updated 3 months ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- Testing LLM reasoning abilities with lineage relationship quizzes.☆35Feb 2, 2026Updated last week
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated 3 weeks ago
- ☆61Apr 22, 2024Updated last year
- Official Pytorch implementations for "Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation"(EC…☆33Mar 15, 2025Updated 11 months ago
- ☆27Aug 30, 2023Updated 2 years ago
- LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games☆91Updated this week
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆41Sep 3, 2025Updated 5 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated last month
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Minecraft mob and difficulty overhaul for Fabric.☆11Feb 28, 2023Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- Auditing agents for fine-tuning safety☆18Oct 21, 2025Updated 3 months ago
- Distill thinking dataset more compactly and accurately!☆37Jun 6, 2025Updated 8 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Nov 24, 2024Updated last year