adamkarvonen / train_ChessGPT
A repository for training nanogpt-based Chess playing language models.
☆22Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for train_ChessGPT
- ☆18Updated last year
- A repo to evaluate various LLM's chess playing abilities.☆68Updated 7 months ago
- ☆57Updated 11 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- ☆40Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆193Updated this week
- ☆16Updated last month
- Simplex Random Feature attention, in PyTorch☆71Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆152Updated this week
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 5 months ago
- look how they massacred my boy☆58Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Chat Markup Language conversation library☆54Updated 10 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆108Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆26Updated 3 weeks ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆98Updated 9 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆57Updated 4 months ago
- Navigating a maze using LLM agent☆34Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- ☆36Updated 3 months ago
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 6 months ago
- ☆104Updated 8 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 11 months ago
- ☆48Updated last year
- ☆27Updated 4 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 6 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆35Updated 4 months ago