llm-conditioned-diffusion/OmniDiffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/llm-conditioned-diffusion/OmniDiffusion)

llm-conditioned-diffusion / OmniDiffusion

☆14

Alternatives and similar repositories for OmniDiffusion

Users that are interested in OmniDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SAIS-FUXI / EvalAlign
View on GitHub
☆19Oct 23, 2024Updated last year
SAIS-FUXI / IPO
View on GitHub
☆58May 6, 2025Updated last year
SAIS-FUXI / VidGen
View on GitHub
☆68Aug 16, 2024Updated last year
kobeshegu / DiverseDiT
View on GitHub
[CVPR-2026] DiverseDiT: Towards Diverse Representation Learning in Diffusion Transformers
☆20Mar 26, 2026Updated 3 months ago
Shopee-MUG / MUG-U
View on GitHub
一个强大的多模态大语言模型（MLLM），支持文本、图像、视频等多模态输入，具备强大的理解、推理和生成能力。
☆23Mar 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SAIS-FUXI / Omni-Video
View on GitHub
☆157Feb 28, 2026Updated 4 months ago
CodeGoat24 / LiFT
View on GitHub
Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.
☆85May 4, 2025Updated last year
Fr0zenCrane / Cockatiel
View on GitHub
Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption
☆38May 21, 2025Updated last year
zcq15 / SPDET
View on GitHub
☆13Jun 1, 2023Updated 3 years ago
mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
VisionChengzhuo / CoF-T2I
View on GitHub
Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.
☆39Jan 16, 2026Updated 6 months ago
wbw520 / RTFP
View on GitHub
Improving Facade Parsing with Vision Transformers and Line Integration
☆24Mar 13, 2024Updated 2 years ago
tmallet / expo-proximity
View on GitHub
Provides access to the system's proximity sensor.
☆26Jan 1, 2025Updated last year
human-analysis / FairerCLIP
View on GitHub
Official code for the paper "FairerCLIP: Debiasing CLIP’s Zero-Shot Predictions using Functions in RKHSs".
☆16Oct 14, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TuShArBhArDwA / Spotlight
View on GitHub
🎥 AI-powered webinar platform with live streaming, smart breakout rooms, and autonomous sales agents.
☆12Jun 21, 2025Updated last year
aliyun / S2net
View on GitHub
☆15Jul 13, 2023Updated 3 years ago
aimagelab / MAD
View on GitHub
Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…
☆15Jul 9, 2025Updated last year
1069066484 / PanoSwinTransformerObjectDetection
View on GitHub
☆18Jun 9, 2023Updated 3 years ago
meder411 / Spherical-Package
View on GitHub
The backend code for my projects associated with spherical images
☆17Oct 12, 2021Updated 4 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
MinglangQiao / awesome-salient-object-ranking
View on GitHub
A curated list of awesome resources for salient object ranking (SOR)
☆17Sep 28, 2025Updated 9 months ago
beepkh / WiseEdit
View on GitHub
☆16Dec 25, 2025Updated 6 months ago
zrealli / LDGen
View on GitHub
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
☆38Mar 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ericsujw / LGPN-net
View on GitHub
This is an official implementation of Layout-guided Indoor Panorama Inpainting with Plane-aware Normalization.
☆15Apr 27, 2023Updated 3 years ago
practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆12Jun 21, 2023Updated 3 years ago
prisma / prisma-edge-functions
View on GitHub
☆18Dec 7, 2023Updated 2 years ago
radzionc / deploy-notebook
View on GitHub
Deploy Jupyter Notebook to AWS Lambda
☆16Nov 18, 2020Updated 5 years ago
haoai-1997 / Elite360D
View on GitHub
☆15Dec 10, 2024Updated last year
neondatabase / preview-branches-with-fly
View on GitHub
A Neon branch for every Fly Preview app
☆26Feb 2, 2026Updated 5 months ago
chma1024 / DensePASS
View on GitHub
☆15Feb 14, 2022Updated 4 years ago
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆233May 31, 2026Updated last month
mdyao / HDR-BiTNet
View on GitHub
[TMM 2023] Official Implementation of "Bidirectional Translation Between UHD-HDR and HD-SDR Videos"
☆10Aug 8, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pjreddie / RayTracer
View on GitHub
☆23Feb 11, 2020Updated 6 years ago
runnanchen / PanoSLAM
View on GitHub
☆17Dec 31, 2024Updated last year
BlueDyee / TF-TI2I
View on GitHub
(ICCV'25) TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models (Au…
☆16Aug 22, 2025Updated 11 months ago
sandflow / hdr4png
View on GitHub
Adds ITU BT.2100 PQ signaling to PNG images
☆15Sep 11, 2017Updated 8 years ago
DreamSoul-AI / ColDA
View on GitHub
Collaborative Data Analysis for All
☆19Jun 15, 2023Updated 3 years ago
hgropper / Automated-Amazon-Purchases
View on GitHub
At the start of the coronavirus many people were scrambling to gather computer parts. As a result the prices of GPU's (Graphics Processin…
☆20Dec 10, 2022Updated 3 years ago
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆34Apr 20, 2026Updated 3 months ago