qihao067 / CrossFlow
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution'
☆164Updated last month
Alternatives and similar repositories for CrossFlow:
Users that are interested in CrossFlow are comparing it to the libraries listed below
- Pixel-Space Generative Models☆178Updated last week
- EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆93Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆149Updated last month
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆135Updated 2 months ago
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆143Updated last week
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆111Updated 2 months ago
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆111Updated last year
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 6 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆110Updated 2 months ago
- ☆157Updated 4 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆183Updated last month
- Author's Implementation for E-LatentLPIPS☆141Updated 5 months ago
- ☆31Updated last month
- ☆87Updated 3 weeks ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆274Updated 4 months ago
- DDT: Decoupled Diffusion Transformer☆201Updated last week
- ☆114Updated 6 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆74Updated last month
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆214Updated last month
- [ICLR 2024] Code for our paper: GNRI: Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models☆44Updated last month
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆109Updated 6 months ago
- Subjects200K dataset☆107Updated 3 months ago
- ☆192Updated 2 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆227Updated 4 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆87Updated 3 weeks ago
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆130Updated last week
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆48Updated 4 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆99Updated 2 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆134Updated last week
- ☆177Updated 2 months ago