sgrvinod / chess-transformersLinks

Teaching transformers to play chess

☆134

Alternatives and similar repositories for chess-transformers

Users that are interested in chess-transformers are comparing it to the libraries listed below

Sorting:

adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆209Updated 8 months ago
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆586Updated 6 months ago
drubinstein / pokemonred_puffer
☆156Updated last month
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆223Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
FlorianDietz / comgra
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…
☆288Updated 8 months ago
bclarkson-code / Tricycle
Autograd to GPT-2 completely from scratch
☆115Updated 3 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆622Updated 4 months ago
akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆145Updated 2 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆114Updated last week
sumo43 / loopvlm
run paligemma in real time
☆131Updated last year
mlecauchois / micrograd-cuda
☆249Updated last year
juraam / snake-diffusion
Implementation snake game based on Diffusion model
☆92Updated 7 months ago
kagisearch / llm-chess-puzzles
Benchmark LLM reasoning capability by solving chess puzzles.
☆86Updated 3 months ago
Sentdex / Lambda-Cloud
Helpers and such for working with Lambda Cloud
☆51Updated last year
neurallambda / neurallambda
Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
☆267Updated 9 months ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆68Updated 5 months ago
RobertRiachi / nanoPALM
☆143Updated 2 years ago
adamkarvonen / train_ChessGPT
A repository for training nanogpt-based Chess playing language models.
☆25Updated last year
s-casci / tinyzero
Easily train AlphaZero-like agents on any environment you want!
☆430Updated last year
llmonpy / needle-in-a-needlestack
☆116Updated 6 months ago
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆318Updated 9 months ago
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆209Updated last year
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated this week
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 3 weeks ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆308Updated 3 months ago
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆284Updated 3 weeks ago
haraschax / nograd
Gradient descent is cool and all, but what if we could delete it?
☆104Updated this week