VainF / Remix-DiT
☆16Updated 4 months ago
Alternatives and similar repositories for Remix-DiT:
Users that are interested in Remix-DiT are comparing it to the libraries listed below
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 10 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆40Updated 9 months ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆18Updated 5 months ago
- The official repo of continuous speculative decoding☆24Updated last month
- ☆13Updated last month
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- ☆70Updated 5 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 5 months ago
- ☆27Updated last month
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 5 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- ☆22Updated 10 months ago
- Stable Consistency Tuning: Understanding and Improving Consistency models☆16Updated 5 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆33Updated last month
- ☆53Updated last year
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 5 months ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆37Updated last year
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- ☆45Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆41Updated 8 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆46Updated last month
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated last month
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 2 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆46Updated 4 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆19Updated last month
- Video Diffusion State Space Models☆19Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆97Updated 3 weeks ago
- ☆30Updated last month
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆19Updated 6 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆69Updated 10 months ago