adamkarvonen / chess_llm_interpretabilityView external linksLinks
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.
☆218Nov 18, 2024Updated last year
Alternatives and similar repositories for chess_llm_interpretability
Users that are interested in chess_llm_interpretability are comparing it to the libraries listed below
Sorting:
- A repository for training nanogpt-based Chess playing language models.☆26Apr 25, 2024Updated last year
- A repo to evaluate various LLM's chess playing abilities.☆87Apr 12, 2024Updated last year
- ☆43Jan 24, 2024Updated 2 years ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Feb 12, 2025Updated last year
- Mistral7B playing DOOM☆29Mar 27, 2024Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆131Oct 26, 2023Updated 2 years ago
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 6 months ago
- Sparsify transformers with SAEs and transcoders☆692Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- Grandmaster-Level Chess Without Search☆606Jan 10, 2025Updated last year
- ☆11Jun 17, 2024Updated last year
- Dust players once they die☆11Feb 17, 2020Updated 5 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆11Dec 11, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- ASMLE - Bootable Wordle in 512 bytes!☆39Mar 11, 2022Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆11Jan 19, 2024Updated 2 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- I learn about and explain quantization☆26Apr 19, 2024Updated last year
- ☆207Oct 14, 2025Updated 4 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 5 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆201Jul 12, 2023Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆10Oct 28, 2024Updated last year
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- StyleGAN Explorer in Colab + JS GUI☆26Jan 29, 2020Updated 6 years ago
- Inter-process communication made simple.☆16Feb 14, 2025Updated last year
- ☆255Jul 15, 2023Updated 2 years ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Apr 21, 2025Updated 9 months ago
- NanoGPT (124M) in 2 minutes☆4,624Updated this week
- Evaluating the Mamba architecture on the Othello game☆49Apr 25, 2024Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆90Oct 23, 2023Updated 2 years ago
- ☆56Nov 6, 2024Updated last year
- ☆82Apr 16, 2024Updated last year
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆240Dec 16, 2024Updated last year