qihao067 / CrossFlow
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution'
☆167Updated 2 months ago
Alternatives and similar repositories for CrossFlow
Users that are interested in CrossFlow are comparing it to the libraries listed below
Sorting:
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆102Updated 2 months ago
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆194Updated 3 weeks ago
- Pixel-Space Generative Models☆197Updated last week
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆136Updated 3 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆113Updated 3 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 7 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆118Updated 2 months ago
- ☆30Updated 2 months ago
- ☆159Updated 4 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆132Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆151Updated last month
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆51Updated 2 weeks ago
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆145Updated 3 weeks ago
- Author's Implementation for E-LatentLPIPS☆146Updated 6 months ago
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆110Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆203Updated 2 weeks ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆274Updated 5 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆111Updated 7 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆95Updated last month
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆79Updated 2 months ago
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆154Updated 2 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆218Updated 2 months ago
- [ICLR 2024] Code for our paper: GNRI: Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models☆48Updated 2 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆226Updated 5 months ago
- ☆95Updated last month
- This is the official implementation for ControlVAR.☆107Updated 5 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆48Updated last month
- DDT: Decoupled Diffusion Transformer☆237Updated 3 weeks ago
- ☆193Updated 3 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆118Updated 3 months ago