adamkarvonen / train_ChessGPTLinks
A repository for training nanogpt-based Chess playing language models.
☆24Updated last year
Alternatives and similar repositories for train_ChessGPT
Users that are interested in train_ChessGPT are comparing it to the libraries listed below
Sorting:
- A repo to evaluate various LLM's chess playing abilities.☆81Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆206Updated 7 months ago
- ☆38Updated 10 months ago
- rl from zero pretrain, can it be done? we'll see.☆49Updated this week
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- look how they massacred my boy☆63Updated 8 months ago
- Collection of LLM completions for reasoning-gym task datasets☆24Updated 3 weeks ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆52Updated 4 months ago
- explore token trajectory trees on instruct and base models☆126Updated 3 weeks ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 9 months ago
- Lego for GRPO☆28Updated 3 weeks ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated 2 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆80Updated 2 weeks ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆50Updated 4 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆63Updated 2 months ago
- ☆32Updated 11 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- ☆61Updated last year
- A framework for optimizing DSPy programs with RL☆75Updated this week
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆20Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆107Updated last year
- Simple repository for training small reasoning models☆31Updated 4 months ago
- ☆52Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated 10 months ago
- Example Agents for DIAMBRA Arena Environments☆17Updated 9 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago