adamkarvonen / train_ChessGPT
A repository for training nanogpt-based Chess playing language models.
☆24Updated last year
Alternatives and similar repositories for train_ChessGPT:
Users that are interested in train_ChessGPT are comparing it to the libraries listed below
- A repo to evaluate various LLM's chess playing abilities.☆81Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- ☆20Updated last year
- ☆38Updated 9 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- Lego for GRPO☆27Updated last month
- Cerule - A Tiny Mighty Vision Model☆67Updated 8 months ago
- ☆61Updated last year
- Train your own SOTA deductive reasoning model☆91Updated 2 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆171Updated this week
- ☆27Updated 10 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆45Updated 2 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated this week
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆15Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆73Updated last week
- ☆48Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆41Updated 3 weeks ago
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- ☆113Updated 4 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 3 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆64Updated 2 weeks ago
- realtime latent world model inference demo☆45Updated 5 months ago
- ☆84Updated last week
- A reading list of relevant papers and projects on foundation model annotation☆27Updated 2 months ago
- prime-rl is a codebase for decentralized RL training at scale☆89Updated this week