nathan-barry / tiny-diffusionLinks
A character-level language diffusion model trained on Tiny Shakespeare
☆849Updated 3 weeks ago
Alternatives and similar repositories for tiny-diffusion
Users that are interested in tiny-diffusion are comparing it to the libraries listed below
Sorting:
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 9 months ago
- ☆465Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆938Updated 7 months ago
- Live-bending a foundation model’s output at neural network level.☆273Updated 9 months ago
- ☆258Updated 11 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆691Updated 7 months ago
- ~950 line, minimal, extensible LLM inference engine built from scratch.☆405Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆347Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆218Updated last year
- ☆179Updated 2 months ago
- ☆214Updated last week
- noise_step: Training in 1.58b With No Gradient Memory☆220Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated 2 months ago
- Open-source release accompanying Gao et al. 2025☆501Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Implementation snake game based on Diffusion model☆93Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆628Updated 10 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- explore token trajectory trees on instruct and base models☆150Updated 8 months ago
- Getting crystal-like representations with harmonic loss☆195Updated 10 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆221Updated 3 months ago
- Build your own visual reasoning model☆418Updated 3 weeks ago
- dLLM: Simple Diffusion Language Modeling☆1,693Updated last month
- ☆541Updated 6 months ago
- Flux 2 image generation model pure C inference☆1,632Updated this week
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 9 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆297Updated 3 weeks ago
- Stupid test to check whether MDL principles improve ARC performance☆76Updated this week
- Mistral7B playing DOOM☆139Updated last year
- High-Performance Implementation of OpenAI's TikToken.☆467Updated 7 months ago