adamkarvonen / train_ChessGPTLinks
A repository for training nanogpt-based Chess playing language models.
☆26Updated last year
Alternatives and similar repositories for train_ChessGPT
Users that are interested in train_ChessGPT are comparing it to the libraries listed below
Sorting:
- A repo to evaluate various LLM's chess playing abilities.☆86Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆218Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- ☆125Updated last year
- Sphynx Hallucination Induction☆53Updated 11 months ago
- look how they massacred my boy☆63Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 10 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Updated 2 months ago
- ☆45Updated 2 years ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated this week
- run paligemma in real time☆133Updated last year
- ☆62Updated 2 years ago
- Simple Transformer in Jax☆140Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 3 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆32Updated last year
- Draw more samples☆198Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆116Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated 2 years ago
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- A visual interface for understanding and interpreting Transformers☆77Updated 2 years ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- Simplex Random Feature attention, in PyTorch☆75Updated 2 years ago
- explore token trajectory trees on instruct and base models☆150Updated 7 months ago
- smol models are fun too☆93Updated last year
- Mistral7B playing DOOM☆138Updated last year
- A Collection of Pydantic Models to Abstract IRL☆36Updated 3 weeks ago