blairstar / The_Art_of_DPMLinks
An In-depth Analysis of Diffusion Probability Model
☆116Updated 8 months ago
Alternatives and similar repositories for The_Art_of_DPM
Users that are interested in The_Art_of_DPM are comparing it to the libraries listed below
Sorting:
- https://www.shoufachen.com/Awesome-Diffusion-Transformers/☆144Updated last year
- Keras implement of Finite Scalar Quantization☆77Updated last year
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆242Updated 3 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 8 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆49Updated 10 months ago
- ☆112Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无 需额外训练为任意扩散模型支持多语言能力)☆135Updated 5 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆35Updated last month
- ☆27Updated 3 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- ☆69Updated 2 years ago
- A list for Text-to-Video, Image-to-Video works☆239Updated last month
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination☆85Updated 3 weeks ago
- ☆32Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆244Updated 4 months ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆37Updated last year
- pytorch大规模数据读取dataset☆13Updated 3 years ago
- ☆104Updated last year
- An initiative to replicate Sora☆104Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆112Updated last year
- ☆129Updated last year
- 扩散模型算法基础文档、训练、实验、部署等仓库☆39Updated 4 months ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆43Updated 4 months ago
- ChatSD is designed to make image generation tasks easily☆20Updated 2 years ago
- A replication of Google's VideoPoet model☆12Updated last year
- ☆82Updated last year
- Our 2nd-gen LMM☆33Updated last year
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆78Updated 6 months ago
- The paper collections for the autoregressive models in vision.☆10Updated 4 months ago