A character-level language diffusion model trained on Tiny Shakespeare
☆904Jan 16, 2026Updated 3 months ago
Alternatives and similar repositories for tiny-diffusion
Users that are interested in tiny-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 11 months ago
- Research repository to the publication: Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molec…☆14Apr 2, 2024Updated 2 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 4 months ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last week
- Everything about the SmolLM and SmolVLM family of models☆3,755Apr 2, 2026Updated last month
- ☆12Dec 14, 2024Updated last year
- Plugin Marketplace for Claude Code☆20Feb 8, 2026Updated 2 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆702Jun 14, 2025Updated 10 months ago
- A neurosymbolic perspective on LLMs☆1,713Apr 27, 2026Updated last week
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- NanoGPT (124M) in 90 seconds☆5,200Updated this week
- Investigation into replacing the MES compiler☆30Apr 29, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Diffusion on syntax trees for program synthesis☆485Jun 27, 2024Updated last year
- Dream 7B, a large diffusion language model☆1,235Nov 21, 2025Updated 5 months ago
- Moved to Scottcjn/legend-of-elya-n64 (consolidated)☆29Mar 21, 2026Updated last month
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- ☆198May 5, 2025Updated last year
- ☆14Jun 25, 2022Updated 3 years ago
- StyleGAN2 - Official TensorFlow Implementation with practical improvements☆11Apr 17, 2020Updated 6 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Dec 19, 2025Updated 4 months ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated 2 years ago
- Triplestore wrapper for HTML5 WebStorage☆22Dec 23, 2015Updated 10 years ago
- A text compressor based on the PAQ architecture.☆22Sep 12, 2025Updated 7 months ago
- EDM-TR9 is an open audio dataset containing a series of TR-909 drum rhythms.☆16Dec 7, 2023Updated 2 years ago
- Deep learning AI for generating new molecules that bond to the COVID-19.☆12Sep 17, 2020Updated 5 years ago
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Apr 10, 2026Updated 3 weeks ago
- A `tree` util enhanced with tokens, lines, and components. `pip install -U tree_plus`☆15Nov 24, 2025Updated 5 months ago
- ☆57Dec 27, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,171Aug 26, 2025Updated 8 months ago
- Production focused Self-harnessed LM runtime (RLM) that allows the LM to call its sub-lm with DSPy signatures. Define your inputs, output…☆267Updated this week
- A curated list of tools, guides and resources for the Replicate AI model platform☆17Jan 10, 2024Updated 2 years ago
- ☆11May 16, 2025Updated 11 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆1,261Jun 8, 2025Updated 10 months ago
- ☆16Feb 28, 2025Updated last year
- A repository for research on medium sized language models.☆78May 23, 2024Updated last year