adamkarvonen / train_ChessGPTLinks

A repository for training nanogpt-based Chess playing language models.

☆26

Alternatives and similar repositories for train_ChessGPT

Users that are interested in train_ChessGPT are comparing it to the libraries listed below

Sorting:

adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆83Updated last year
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆216Updated 11 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆109Updated 7 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆181Updated 2 weeks ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 2 months ago
sumo43 / loopvlm
run paligemma in real time
☆133Updated last year
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆138Updated last year
gkamradt / SnakeBench
☆93Updated 4 months ago
vgel / logitloom
explore token trajectory trees on instruct and base models
☆148Updated 5 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆113Updated last year
yizhe-ang / interactive-transformer
A visual interface for understanding and interpreting Transformers
☆77Updated 2 years ago
xjdr-alt / muzero_sketch
☆40Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 7 months ago
conglu1997 / ACD
Automated Capability Discovery via Foundation Model Self-Exploration
☆65Updated 8 months ago
lechmazur / nyt-connections
Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words
☆155Updated 2 weeks ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 11 months ago
jxmorris12 / gptzip
Losslessly encode text natively with arithmetic coding and HuggingFace Transformers
☆76Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 8 months ago
NousResearch / StripedHyenaTrainer
☆61Updated last year
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆172Updated 9 months ago
normal-computing / extended-mind-transformers
☆123Updated last year
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆64Updated 3 weeks ago
LucasSte / MLX-vs-Pytorch
Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs
☆89Updated last year
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆70Updated 8 months ago
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆290Updated 2 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆117Updated last week
drubinstein / pokemonred_puffer
☆170Updated 3 months ago