kagisearch / llm-chess-puzzles
Benchmark LLM reasoning capability by solving chess puzzles.
☆75Updated 11 months ago
Alternatives and similar repositories for llm-chess-puzzles:
Users that are interested in llm-chess-puzzles are comparing it to the libraries listed below
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- A repo to evaluate various LLM's chess playing abilities.☆81Updated last year
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆253Updated 2 weeks ago
- Teaching transformers to play chess☆121Updated 2 months ago
- A repository for training nanogpt-based Chess playing language models.☆24Updated last year
- Grandmaster-Level Chess Without Search☆571Updated 3 months ago
- The history files when recording human interaction while solving ARC tasks☆106Updated last week
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated last month
- An implementation of bucketMul LLM inference☆216Updated 9 months ago
- Grow virtual creatures in static and physics simulated environments.☆52Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆170Updated this week
- LLMs playing chess are sensitive to how the position came to be☆22Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆45Updated 2 months ago
- Draw more samples☆189Updated 10 months ago
- This repository contain the simple llama3 implementation in pure jax.☆63Updated 2 months ago
- Visualize the intermediate output of Mistral 7B☆357Updated 3 months ago
- ☆89Updated last month
- LLM Chess - Large Language Models Competing in Chess☆39Updated this week
- Simple Transformer in Jax☆136Updated 10 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆112Updated this week
- A dictionary for https://neal.fun/infinite-craft/☆18Updated last year
- PageRank for LLMs☆41Updated 2 weeks ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆291Updated this week
- ☆106Updated 4 months ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆349Updated 10 months ago
- Array-Inspired Pipeline Language☆119Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆284Updated 2 weeks ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 5 months ago