adamkarvonen / train_ChessGPT
A repository for training nanogpt-based Chess playing language models.
☆22Updated 4 months ago
Related projects: ⓘ
- A repo to evaluate various LLM's chess playing abilities.☆62Updated 5 months ago
- ☆18Updated 11 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆143Updated this week
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆190Updated 3 months ago
- ☆55Updated 9 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆42Updated last year
- ☆48Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated last year
- Simplex Random Feature attention, in PyTorch☆71Updated 11 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 weeks ago
- inference code for mixtral-8x7b-32kseqlen☆97Updated 9 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- Full finetuning of large language models without large memory requirements☆94Updated 8 months ago
- Chat Markup Language conversation library☆53Updated 8 months ago
- ☆89Updated 11 months ago
- The history files when recording human interaction while solving ARC tasks☆91Updated this week
- Jax like function transformation engine but micro, microjax☆24Updated 3 weeks ago
- run paligemma in real time☆122Updated 4 months ago
- ☆101Updated 6 months ago
- ☆68Updated 2 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆18Updated 2 months ago
- ☆36Updated 6 months ago
- Sparse autoencoders for Contra text embedding models☆24Updated 4 months ago
- ☆22Updated last year
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆69Updated 2 months ago
- ☆50Updated 4 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago