sgrvinod / chess-transformersLinks
Teaching transformers to play chess
☆141Updated 9 months ago
Alternatives and similar repositories for chess-transformers
Users that are interested in chess-transformers are comparing it to the libraries listed below
Sorting:
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆216Updated 11 months ago
- Grandmaster-Level Chess Without Search☆594Updated 9 months ago
- A pure NumPy implementation of Mamba.☆223Updated last year
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆291Updated 10 months ago
- Autograd to GPT-2 completely from scratch☆125Updated 2 months ago
- Implementation snake game based on Diffusion model☆91Updated 9 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 6 months ago
- ☆170Updated 3 months ago
- run paligemma in real time☆133Updated last year
- Code for the Fractured Entangled Representation Hypothesis position paper!☆203Updated 5 months ago
- 🚀 JIT Implementation: Code That Writes Itself☆115Updated last year
- ☆248Updated last year
- The history files when recording human interaction while solving ARC tasks☆117Updated last week
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆350Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆625Updated 7 months ago
- Gradient descent is cool and all, but what if we could delete it?☆104Updated 2 months ago
- Visualize the intermediate output of Mistral 7B☆375Updated 9 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆122Updated last year
- Next Generation Experimental Tracking for Machine Learning Operations☆348Updated 5 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- A repo to evaluate various LLM's chess playing abilities.☆83Updated last year
- a small code base for training large models☆310Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated last month
- ☆138Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated last week
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago
- a curated list of data for reasoning ai☆140Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 8 months ago