sayakpaul / nanoDiTLinks

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

☆120

Alternatives and similar repositories for nanoDiT

Users that are interested in nanoDiT are comparing it to the libraries listed below

Sorting:

fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆76Updated 7 months ago
bluorion-com / ZClip
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
☆131Updated last month
lucidrains / maskbit-pytorch
Implementation of the proposed MaskBit from Bytedance AI
☆82Updated 8 months ago
SwayStar123 / microdiffusion
☆47Updated 5 months ago
cloneofsimo / repa-rf
☆32Updated 9 months ago
lucidrains / spline-based-transformer
Implementation of the proposed Spline-Based Transformer from Disney Research
☆102Updated 8 months ago
lucidrains / titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆176Updated last year
lucidrains / hyper-connections
Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public
☆88Updated last month
lucidrains / hl-gauss-pytorch
The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch
☆65Updated 2 months ago
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆102Updated last year
lucidrains / adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆112Updated 8 months ago
cloneofsimo / min-max-in-dit
☆27Updated last year
lucidrains / h-net-dynamic-chunking
Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon
☆63Updated this week
AnonymousAlethiometer / SGD_SaI
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆52Updated 6 months ago
cloneofsimo / efae
☆23Updated last year
cloneofsimo / vqgan-training
Train VAE like a boss
☆287Updated 9 months ago
huggingface / flux-fast
Making Flux go brrr on GPUs.
☆124Updated 2 weeks ago
deepreinforce-ai / CUDA-L1
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
☆131Updated this week
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated 8 months ago
lucidrains / pytorch-custom-utils
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆124Updated last year
OliverRensu / MVAR
☆70Updated 8 months ago
lucidrains / genie2-pytorch
Implementation of a framework for Genie2 in Pytorch
☆149Updated 6 months ago
huggingface / lora-fast
Minimal repository to demonstrate fast LoRA inference with Flux family of models.
☆20Updated 2 weeks ago
lucidrains / deep-cross-attention
Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch
☆90Updated 5 months ago
cloneofsimo / minDinoV2
☆21Updated 9 months ago
fal-ai-community / nano-mdm
Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun
☆55Updated 4 months ago
SwayStar123 / reimei
☆24Updated 3 months ago
NVlabs / GSPN
[CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network
☆104Updated 2 weeks ago
lucidrains / vit-arc-slot
Explorations into improving ViTArc with Slot Attention
☆42Updated 9 months ago
cloneofsimo / infinite-fractal-stream
☆30Updated 10 months ago