qihao067 / CrossFlow
This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolution'
☆130Updated last week
Alternatives and similar repositories for CrossFlow:
Users that are interested in CrossFlow are comparing it to the libraries listed below
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆110Updated last month
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆123Updated 3 weeks ago
- Author's Implementation for E-LatentLPIPS☆134Updated 3 months ago
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆61Updated 4 months ago
- Official PyTorch Implementation for Readout Guidance, CVPR 2024☆136Updated 4 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆95Updated last year
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 4 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆78Updated last week
- ☆107Updated 11 months ago
- ☆138Updated 2 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆86Updated 5 months ago
- Learning Motion from Low-Rank Adaptation☆44Updated 8 months ago
- ☆188Updated last week
- ☆76Updated last year
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆246Updated last month
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 7 months ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆77Updated 11 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆262Updated 2 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆114Updated last month
- ☆110Updated 4 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆45Updated 2 months ago
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆111Updated this week
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆102Updated last month
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆114Updated 3 weeks ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆99Updated 6 months ago