dpaleka / llm-chess-proofgameLinks
LLMs playing chess are sensitive to how the position came to be
☆23Updated last year
Alternatives and similar repositories for llm-chess-proofgame
Users that are interested in llm-chess-proofgame are comparing it to the libraries listed below
Sorting:
- Visual Transformer Mechanistic Analysis Tool☆33Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆205Updated 6 months ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 6 months ago
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆62Updated 2 months ago
- A star for organising blocks and playing with transformers.☆23Updated last year
- Pivotal Token Search☆89Updated 2 weeks ago
- Benchmark LLM reasoning capability by solving chess puzzles.☆80Updated last month
- ☆27Updated 8 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 9 months ago
- Grow virtual creatures in static and physics simulated environments.☆52Updated last year
- A Full Transcript of the Lighthill Debate on AI from 1973, with Introductory Remarks☆30Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Visualize text embeddings☆40Updated last year
- ☆93Updated 4 months ago
- When AI Fails is a project dedicated to documenting the funny, interesting, and sometimes outright stupid ways in which AI can fail.☆61Updated last month
- Interactive Fiction in the Age of AI☆29Updated last week
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- A playground to make it easy to try crazy things☆33Updated last month
- Automated Capability Discovery via Foundation Model Self-Exploration☆47Updated 3 months ago
- ☆28Updated last year
- A repository for training nanogpt-based Chess playing language models.☆24Updated last year
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- a curated list of data for reasoning ai☆136Updated 9 months ago
- ☆36Updated 2 years ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆19Updated 2 months ago
- LLM plugin for clustering embeddings☆76Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated 10 months ago
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆93Updated this week
- A repo to evaluate various LLM's chess playing abilities.☆80Updated last year