adamkarvonen / chess_llm_interpretabilityLinks

Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.

☆216

Alternatives and similar repositories for chess_llm_interpretability

Users that are interested in chess_llm_interpretability are comparing it to the libraries listed below

Sorting:

valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆375Updated 9 months ago
Cerebras / gigaGPT
a small code base for training large models
☆309Updated 5 months ago
adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆83Updated last year
drubinstein / pokemonred_puffer
☆170Updated 3 months ago
adamkarvonen / train_ChessGPT
A repository for training nanogpt-based Chess playing language models.
☆26Updated last year
sgrvinod / chess-transformers
Teaching transformers to play chess
☆142Updated 8 months ago
bclarkson-code / Tricycle
Autograd to GPT-2 completely from scratch
☆125Updated 2 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆138Updated last year
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆625Updated 7 months ago
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆374Updated last year
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆117Updated last week
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated last month
kagisearch / llm-chess-puzzles
Benchmark LLM reasoning capability by solving chess puzzles.
☆87Updated 5 months ago
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆593Updated 9 months ago
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆350Updated last year
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
google-deepmind / recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
☆652Updated 4 months ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆181Updated last week
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆223Updated last year
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆290Updated 2 months ago
rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆194Updated last year
mlecauchois / micrograd-cuda
☆248Updated last year
llmonpy / needle-in-a-needlestack
☆116Updated 8 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆124Updated 6 months ago
namin / llm-verified-with-monte-carlo-tree-search
LLM verified with Monte Carlo Tree Search
☆280Updated 6 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated last month
Futrell / ziplm
☆254Updated 2 years ago
RobertRiachi / nanoPALM
☆144Updated 2 years ago
sumo43 / loopvlm
run paligemma in real time
☆133Updated last year
EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆357Updated last year