adamkarvonen / train_ChessGPTLinks
A repository for training nanogpt-based Chess playing language models.
☆25Updated last year
Alternatives and similar repositories for train_ChessGPT
Users that are interested in train_ChessGPT are comparing it to the libraries listed below
Sorting:
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆208Updated 8 months ago
- A repo to evaluate various LLM's chess playing abilities.☆82Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆58Updated 5 months ago
- Train your own SOTA deductive reasoning model☆101Updated 4 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆177Updated 2 weeks ago
- look how they massacred my boy☆63Updated 9 months ago
- explore token trajectory trees on instruct and base models☆134Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 5 months ago
- Draw more samples☆193Updated last year
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆190Updated this week
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 5 months ago
- run paligemma in real time☆131Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- ☆61Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆38Updated last year
- ☆88Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆93Updated this week
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- a curated list of data for reasoning ai☆137Updated 11 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆64Updated 9 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 9 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆103Updated 4 months ago
- ☆154Updated 3 weeks ago
- ☆94Updated this week
- ☆116Updated 6 months ago
- Lego for GRPO☆28Updated 2 months ago