showlab / D-ARLinks
the official repo for "D-AR: Diffusion via Autoregressive Models"
β30Updated last week
Alternatives and similar repositories for D-AR
Users that are interested in D-AR are comparing it to the libraries listed below
Sorting:
- πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"β27Updated 7 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"β47Updated 6 months ago
- β23Updated 11 months ago
- Native-resolution diffusion Transformerβ43Updated this week
- β33Updated 4 months ago
- The official repo of continuous speculative decodingβ26Updated 2 months ago
- π₯ Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"β73Updated last week
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generationβ74Updated 3 months ago
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"β41Updated 2 months ago
- [ICLR 2025] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffuβ¦β19Updated 2 weeks ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β41Updated 9 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025β11Updated 2 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generationβ30Updated 6 months ago
- Boosting Generative Image Modeling via Joint Image-Feature Synthesisβ34Updated last month
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β47Updated 7 months ago
- β9Updated last year
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.β29Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decodingβ63Updated 2 months ago
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcingβ55Updated last week
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initializationβ21Updated last month
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modelingβ31Updated 3 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editinβ¦β30Updated 3 weeks ago
- Code for paper "Principal Components" Enable A New Language of Imagesβ41Updated last month
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'β17Updated 7 months ago
- β21Updated 5 months ago
- Video Diffusion State Space Modelsβ19Updated last year
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ62Updated 3 months ago
- Unifying Specialized Visual Encoders for Video Language Modelsβ18Updated last week
- β22Updated 7 months ago
- A curated list of papers and resources for text-to-image evaluation.β29Updated last year