Epiphqny / PAR

The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/

☆110

Alternatives and similar repositories for PAR:

Users that are interested in PAR are comparing it to the libraries listed below

qihao067 / CrossFlow
This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…
☆130Updated last week
causalfusion / causalfusion
☆138Updated 2 months ago
NVlabs / T-Stitch
[ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…
☆99Updated 11 months ago
Litalby1 / make-it-count
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"
☆66Updated 8 months ago
UCSC-VLAA / HQ-Edit
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
☆86Updated 10 months ago
SPRIGHT-T2I / SPRIGHT
[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"
☆99Updated 7 months ago
fusiming3 / MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆83Updated 7 months ago
jianzongwu / MotionBooth
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆126Updated 4 months ago
TIGER-AI-Lab / OmniEdit
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆79Updated 3 weeks ago
ali-vilab / FreeScale
Code for FreeScale, a tuning-free method for higher-resolution visual generation
☆114Updated last month
microsoft / Reducio-VAE
☆188Updated last week
PKU-YuanGroup / WF-VAE
Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆114Updated 3 weeks ago
NJU-PCALab / OpenVid-1M
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
☆241Updated this week
showlab / T2VScore
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆79Updated 10 months ago
AdaCache-DiT / AdaCache
Adaptive Caching for Faster Video Generation with Diffusion Transformers
☆142Updated 3 months ago
Adamdad / vico
Vico: Compositional Video Generation as Flow Equalization
☆57Updated 3 months ago
huang-yh / Owl
☆47Updated 2 months ago
desaixie / pa_vdm
ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆61Updated 4 months ago
weijiawu / ParaDiffusion
Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'
☆102Updated 2 months ago
czg1225 / CoDe
CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆75Updated 3 weeks ago
THUDM / VisionReward
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆140Updated this week
wutong16 / FiVA
[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"
☆66Updated last month
Hritikbansal / videophy
Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics
☆78Updated last week
yhZhai / mcm
[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
☆58Updated 3 months ago
SilentView / LVD-2M
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆45Updated 4 months ago
ali-vilab / ChatDiT
☆39Updated last month
feizc / Ingredients
Blending Custom Photos with Video Diffusion Transformers
☆43Updated 3 weeks ago
kamwoh / partcraft
[ECCV2024] PartCraft: Crafting Creative Objects by Parts
☆87Updated 3 weeks ago
tyshiwo1 / Accelerating-T2I-AR-with-SJD
Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
☆28Updated 3 months ago