nathan-barry / tiny-diffusionLinks
A character-level language diffusion model trained on Tiny Shakespeare
☆594Updated 3 weeks ago
Alternatives and similar repositories for tiny-diffusion
Users that are interested in tiny-diffusion are comparing it to the libraries listed below
Sorting:
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 7 months ago
- Live-bending a foundation model’s output at neural network level.☆271Updated 8 months ago
- Heirarchical Navigable Small Worlds☆101Updated 4 months ago
- ☆245Updated 9 months ago
- ☆199Updated 7 months ago
- High-Performance Implementation of OpenAI's TikToken.☆464Updated 5 months ago
- ☆458Updated 2 weeks ago
- Pivotal Token Search☆132Updated last week
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆293Updated 3 months ago
- ☆164Updated 8 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆687Updated 5 months ago
- A tiny autograd engine with a Jax-like API☆74Updated 5 months ago
- ☆249Updated last year
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆343Updated last month
- ☆47Updated 8 months ago
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆362Updated 6 months ago
- explore token trajectory trees on instruct and base models☆149Updated 6 months ago
- ☆176Updated last week
- Mistral7B playing DOOM☆138Updated last year
- Autograd to GPT-2 completely from scratch☆125Updated 4 months ago
- Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)☆360Updated last month
- An implementation of bucketMul LLM inference☆223Updated last year
- Parallel thinking for LLMs. Confidence‑gated, strategy‑driven, offline‑friendly☆259Updated 2 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆628Updated 8 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- A playground to make it easy to try crazy things☆33Updated 2 weeks ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆221Updated 5 months ago
- Multimodal RAG to search and interact locally with technical documents of any kind☆279Updated last month
- See Through Your Models☆402Updated 5 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆164Updated last week