sayakpaul / nanoDiTLinks
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
☆141Updated 7 months ago
Alternatives and similar repositories for nanoDiT
Users that are interested in nanoDiT are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆141Updated last month
- Focused on fast experimentation and simplicity☆76Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆83Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆183Updated last year
- ☆48Updated 10 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆70Updated last month
- Implementation of a multimodal diffusion transformer in Pytorch☆107Updated last year
- Train VAE like a boss☆311Updated last year
- ☆27Updated last year
- Flash Attention Triton kernel with support for second-order derivatives☆125Updated last week
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆94Updated 6 months ago
- ☆32Updated last year
- Official PyTorch Implementation of "Flow Map Distillation Without Data"☆103Updated last month
- Implementation of the proposed Spline-Based Transformer from Disney Research☆105Updated last year
- Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon☆65Updated 4 months ago
- Implementation of a framework for Genie2 in Pytorch☆156Updated 11 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 9 months ago
- Making Flux go brrr on GPUs.☆159Updated 5 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆133Updated 8 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆125Updated last year
- Official PyTorch implementation of TokenSet.☆127Updated 9 months ago
- ☆28Updated 2 months ago
- ☆23Updated last year
- The aim of this repository is to test and implement Flow-Matching-based models☆120Updated 11 months ago
- ☆163Updated 2 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆133Updated 3 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆96Updated 10 months ago
- ☆22Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- ☆171Updated 2 months ago