Epiphqny / PAR
The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/
☆110Updated last month
Alternatives and similar repositories for PAR:
Users that are interested in PAR are comparing it to the libraries listed below
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆130Updated last week
- ☆138Updated 2 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆99Updated 11 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"☆66Updated 8 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆86Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 7 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆126Updated 4 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆79Updated 3 weeks ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆114Updated last month
- ☆188Updated last week
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆114Updated 3 weeks ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆241Updated this week
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆79Updated 10 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆142Updated 3 months ago
- Vico: Compositional Video Generation as Flow Equalization☆57Updated 3 months ago
- ☆47Updated 2 months ago
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆61Updated 4 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated 2 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆140Updated this week
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆78Updated last week
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆58Updated 3 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 4 months ago
- ☆39Updated last month
- Blending Custom Photos with Video Diffusion Transformers☆43Updated 3 weeks ago
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆87Updated 3 weeks ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆28Updated 3 months ago