thunlp / ACDiT
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
☆22Updated last week
Alternatives and similar repositories for ACDiT:
Users that are interested in ACDiT are comparing it to the libraries listed below
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆59Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆55Updated 2 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated last month
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated 5 months ago
- ☆43Updated last week
- code based for rectified flow☆30Updated last week
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆34Updated 9 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆75Updated last month
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆40Updated 7 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆38Updated last month
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆62Updated 7 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆36Updated 3 weeks ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆31Updated 6 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆63Updated this week
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 8 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆39Updated 4 months ago
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆44Updated 2 weeks ago
- ☆121Updated 3 weeks ago
- ☆20Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆54Updated last year
- ☆27Updated last year
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆42Updated this week
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆92Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 9 months ago
- ☆42Updated last week
- Blending Custom Photos with Video Diffusion Transformers☆20Updated this week
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- ☆35Updated 6 months ago
- ☆43Updated 4 months ago