jacklishufan/OmniFlows

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jacklishufan/OmniFlows)

jacklishufan / OmniFlows

The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

☆133

Alternatives and similar repositories for OmniFlows

Users that are interested in OmniFlows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YangLing0818 / consistency_flow_matching
View on GitHub
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
☆270Jan 17, 2025Updated last year
cloneofsimo / ptar
View on GitHub
☆13Jun 3, 2024Updated 2 years ago
lzw-lzw / UnifiedMLLM
View on GitHub
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
☆22Aug 5, 2024Updated last year
NVlabs / QLIP
View on GitHub
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
☆97Mar 1, 2025Updated last year
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
XianfengWu01 / LightGen
View on GitHub
An Efficient Text-to-Image Generation Pretrain Pipeline
☆132Apr 18, 2025Updated last year
ddw2AIGROUP2CQUPT / Face-MakeUp
View on GitHub
Face-MakeUp (SD1.5): Multimodal Facial Prompts for Text-to-Image Generation （ECAI-2025）
☆26Jan 19, 2025Updated last year
lodestone-rock / torchastic
View on GitHub
stochastic bfloat16 based optimizer library
☆22Dec 4, 2024Updated last year
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
jianzongwu / MotionBooth
View on GitHub
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Oct 8, 2024Updated last year
lysanderism / TimeAudio
View on GitHub
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…
☆30Nov 18, 2025Updated 7 months ago
G-U-N / Rectified-Diffusion
View on GitHub
[ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need
☆251Mar 11, 2025Updated last year
pOpsPaper / pOps
View on GitHub
Official implementation for "pOps: Photo-Inspired Diffusion Operators"
☆86Jul 23, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ChocoWu / Any2Caption
View on GitHub
This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
☆49Apr 3, 2025Updated last year
ML-GSAI / eMIGM
View on GitHub
Official PyTorch implementation for "Effective and Efficient Masked Image Generation Models"
☆34Apr 8, 2025Updated last year
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
tzco / Diffusion-wo-CFG
View on GitHub
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆175Feb 18, 2025Updated last year
qihao067 / CrossFlow
View on GitHub
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆342Jun 8, 2025Updated last year
Shakker-Labs / RepText
View on GitHub
RepText: Rendering Visual Text via Replicating 🔥
☆139Jun 7, 2025Updated last year
Huage001 / LinFusion
View on GitHub
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
☆317Dec 23, 2024Updated last year
gwh22 / LAFMA
View on GitHub
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆44Jun 13, 2024Updated 2 years ago
haoningwu3639 / MegaFusion
View on GitHub
[WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
☆101Apr 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fenfenfenfan / VMix
View on GitHub
Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
☆191Dec 31, 2024Updated last year
ByteVisionLab / TokenFlow
View on GitHub
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
☆464Aug 8, 2025Updated 11 months ago
jh-cha-prml / JELLY
View on GitHub
Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"
☆14Nov 5, 2024Updated last year
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,503Dec 16, 2025Updated 6 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆526Nov 14, 2025Updated 7 months ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
Eureka-Maggie / MIGE
View on GitHub
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆72Jul 13, 2025Updated 11 months ago
feifeiobama / RectifID
View on GitHub
[NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
☆129Oct 13, 2024Updated last year
Dorniwang / UniVerse-1-code
View on GitHub
The official UniVerse-1 code.
☆129Oct 13, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
liujin112 / ZePo
View on GitHub
[ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"
☆43Aug 22, 2024Updated last year
csguoh / ReFIR
View on GitHub
[NeurIPS2024] Overcome hallucination of diffusion restoration models.
☆66Apr 14, 2025Updated last year
EnVision-Research / ComfyMind
View on GitHub
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
☆123Sep 20, 2025Updated 9 months ago
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆141Sep 2, 2025Updated 10 months ago
mapo-t2i / mapo
View on GitHub
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
☆82Jun 11, 2024Updated 2 years ago
Westlake-AGI-Lab / StyleStudio
View on GitHub
[CVPR 2025] Official implementation of StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
☆171Nov 13, 2025Updated 7 months ago
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆106Feb 11, 2025Updated last year