Epiphqny / PAR
The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/
☆103Updated last week
Alternatives and similar repositories for PAR:
Users that are interested in PAR are comparing it to the libraries listed below
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆118Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 5 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆65Updated last month
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated 2 weeks ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆98Updated 6 months ago
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆106Updated last month
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆85Updated 3 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆122Updated 3 months ago
- ☆179Updated 3 weeks ago
- NOVA: Autoregressive Video Generation without Vector Quantization☆299Updated this week
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆103Updated 7 months ago
- ☆124Updated 3 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- ☆121Updated 3 weeks ago
- ☆53Updated last month
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆185Updated this week
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"☆64Updated 6 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆79Updated 8 months ago
- ☆220Updated 5 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated last month
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆95Updated 10 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆133Updated 2 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 2 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆131Updated 6 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆109Updated 2 weeks ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆97Updated 10 months ago
- Official code for "ControlAR: Controllable Image Generation with Autoregressive Models"☆170Updated 2 weeks ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆105Updated 2 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆66Updated last month