SonyResearch / micro_diffusion
Official repository for our work on micro-budget training of large-scale diffusion models.
☆794Updated this week
Alternatives and similar repositories for micro_diffusion:
Users that are interested in micro_diffusion are comparing it to the libraries listed below
- UNet diffusion model in pure CUDA☆596Updated 6 months ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆465Updated 6 months ago
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- Schedule-Free Optimization in PyTorch☆2,061Updated last month
- The Tensor (or Array)☆418Updated 5 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆261Updated 7 months ago
- Train VAE like a boss☆252Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆644Updated this week
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆408Updated last month
- Helpful tools and examples for working with flex-attention☆583Updated this week
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆1,774Updated 2 weeks ago
- The Multilayer Perceptron Language Model☆532Updated 5 months ago
- For optimization algorithm research and development.☆484Updated this week
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆467Updated last month
- A suite of image and video neural tokenizers☆1,478Updated this week
- Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆791Updated last month
- PyTorch native quantization and sparsity for training and inference☆1,753Updated this week
- Code for BLT research paper☆1,314Updated this week
- Text to Image Latent Diffusion using a Transformer core☆158Updated 4 months ago
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,257Updated this week
- The Autograd Engine☆550Updated 4 months ago
- Puzzles for learning Triton☆1,300Updated last month
- nanoGPT style version of Llama 3.1☆1,290Updated 5 months ago
- Annotated version of the Mamba paper☆469Updated 10 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,143Updated last month
- EDM2 and Autoguidance -- Official PyTorch implementation☆610Updated last month
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆270Updated 2 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆262Updated 5 months ago
- 94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds☆195Updated last month