ash80/diffusion-gpt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ash80/diffusion-gpt)

ash80 / diffusion-gpt

From baby GPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).

☆258

Alternatives and similar repositories for diffusion-gpt

Users that are interested in diffusion-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

McGill-NLP / the-markovian-thinker
View on GitHub
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆349Mar 16, 2026Updated 4 months ago
OpenGVLab / NaViL
View on GitHub
☆94Oct 10, 2025Updated 9 months ago
Arseny5 / InternVL-3.5-QLoRA-Fine-tune
View on GitHub
Fine-tuning InternVL-3.5-1B with different PEFT strategies: LoRA/QLoRA/full-tuning on MVBench dataset
☆21Oct 14, 2025Updated 9 months ago
simonw / sqlite-s3vfs
View on GitHub
Recovered from https://archive.softwareheritage.org/browse/origin/directory/?origin_url=https://github.com/uktrade/sqlite-s3vfs
☆40Dec 30, 2025Updated 6 months ago
cloneofsimo / minVJEPA
View on GitHub
☆25May 23, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
RohanAdwankar / cgpu
View on GitHub
CLI enabling free cloud GPU access in your terminal for learning CUDA.
☆142Nov 30, 2025Updated 7 months ago
Naresh-Khatri / Coding-Ducks
View on GitHub
☆31Jun 14, 2026Updated last month
inclusionAI / dInfer
View on GitHub
dInfer: An Efficient Inference Framework for Diffusion Language Models
☆476Feb 11, 2026Updated 5 months ago
daydreamsai / lucid-agents
View on GitHub
Lucid Agents Commerce SDK. Bootstrap AI agents in 60 seconds that can pay, sell, and participate in agentic commerce supply chains. Our p…
☆189Updated this week
CopilotKit / deep-agents-job-search-assistant
View on GitHub
Full-stack job search assistant using CopilotKit & LangChain Deep Agents: upload a resume, extract skills, run sub-agents to search the w…
☆17Mar 12, 2026Updated 4 months ago
Apollo-Level2-Web-Dev / B6A1
View on GitHub
☆31Nov 17, 2025Updated 8 months ago
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆121Jul 13, 2026Updated last week
thu-coai / Glyph
View on GitHub
Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
☆592Nov 4, 2025Updated 8 months ago
code-423n4 / 2025-10-hybra-finance
View on GitHub
☆37Jan 16, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
titogabus / ComfyUI-InstaSwap
View on GitHub
☆17Apr 23, 2024Updated 2 years ago
nathanrs / tiny-diffusion
View on GitHub
A character-level language diffusion model trained on Tiny Shakespeare
☆922Jan 16, 2026Updated 6 months ago
nisaruj / comfyui-daam
View on GitHub
ComfyUI custom nodes for Diffusion Attentive Attribution Maps (DAAM)
☆51Oct 13, 2025Updated 9 months ago
SamsungSAILMontreal / TinyRecursiveModels
View on GitHub
☆6,573Apr 1, 2026Updated 3 months ago
artcore-c / AI-Voice-Clone-with-Qwen3-TTS
View on GitHub
Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab. Clone your voice with just a few seconds of audio. Complete guid…
☆16Feb 10, 2026Updated 5 months ago
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆400May 31, 2025Updated last year
K-Dense-AI / karpathy
View on GitHub
An agentic Machine Learning Engineer
☆1,511May 29, 2026Updated last month
bgreene2 / ComfyUI-Hunyuan-Image-3
View on GitHub
A ComfyUI custom node that provides a basic implementation of Hunyuan Image 3.0 using Transformers.
☆33Oct 15, 2025Updated 9 months ago
farukalpay / xml-lib
View on GitHub
XML-Lib is an over-engineered XML playground with a canonical lifecycle, a guardrail subsystem, Hilbert-backed proofs, and PPTX workflow …
☆19Nov 17, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
QasimWani / spatial-reasoning
View on GitHub
Reasoning Systems with tool use are strong zero-shot object detectors
☆70Oct 9, 2025Updated 9 months ago
tuanlda78202 / gpt-oss-amd
View on GitHub
implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs
☆174Oct 25, 2025Updated 8 months ago
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆970Jul 4, 2026Updated 2 weeks ago
pcx-wave / skill-router
View on GitHub
A meta-skill for Claude Code that routes requests to the right installed skill — suggest mode or silent auto-routing
☆23Jul 4, 2026Updated 2 weeks ago
itanishqshelar / SmartRAG
View on GitHub
SmartRAG is a privacy-first multimodal RAG system that lets you chat intelligently with your documents, images, and audio. Upload PDFs, W…
☆110Apr 6, 2026Updated 3 months ago
christian-saldana / ComfyViewer
View on GitHub
☆50Oct 1, 2025Updated 9 months ago
avbiswas / fast-rlm
View on GitHub
A feature rich implementation of Recursive Language Models, with ACP integration, REPL tool support, structured IO, advanced visualizatio…
☆450Jul 7, 2026Updated 2 weeks ago
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated this week
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆17Feb 20, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vivekkalyanarangan30 / llm_from_scratch
View on GitHub
☆1,073Oct 23, 2025Updated 8 months ago
FoundationAgents / ReCode
View on GitHub
Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.
☆561Apr 21, 2026Updated 3 months ago
espresso3389 / MioTTS-llama.cpp
View on GitHub
A fast, lightweight text-to-speech tool that runs entirely on your CPU. Give it text, pick a voice, and get a WAV file out.
☆67Feb 22, 2026Updated 4 months ago
comfyuiAudio / ComfyAudio
View on GitHub
ComfyUI for Audio
☆43Sep 21, 2025Updated 10 months ago
microsoft / dion
View on GitHub
Dion optimizer algorithm
☆494Jul 12, 2026Updated last week
kaist-cvml / geometric-distillation
View on GitHub
[EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
☆39Jun 12, 2025Updated last year
Paulescu / image-classification-with-local-vlms
View on GitHub
Learn to build and deploy local Visual Language Models for Edge AI
☆375Oct 30, 2025Updated 8 months ago