luping-liu/LongAlign

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/luping-liu/LongAlign)

luping-liu / LongAlign

The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)

☆83

Alternatives and similar repositories for LongAlign

Users that are interested in LongAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luping-liu / Detector-Guidance
View on GitHub
The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)
☆20Feb 7, 2024Updated 2 years ago
KwonGihyun / TweedieMix
View on GitHub
Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)
☆62Jan 22, 2025Updated last year
I2-Multimedia-Lab / Magnet
View on GitHub
Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…
☆31Dec 2, 2024Updated last year
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
YangLing0818 / RealCompo
View on GitHub
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
☆121Nov 14, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated last year
Chenfeng1271 / SVDiff
View on GitHub
Streaming Video Diffusion: Online Video Editing with Diffusion Models
☆17Jun 3, 2024Updated 2 years ago
sail-sg / LightTrans
View on GitHub
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
☆22Apr 22, 2025Updated last year
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
sail-sg / finetune-fair-diffusion
View on GitHub
Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness
☆47Apr 26, 2024Updated 2 years ago
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
itsmag11 / Omegance
View on GitHub
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)
☆52Jan 14, 2026Updated 6 months ago
TIGER-AI-Lab / OmniEdit
View on GitHub
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆144Jan 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rhfeiyang / Opt-In-Art
View on GitHub
Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)
☆33Nov 30, 2025Updated 7 months ago
CaraJ7 / CoMat
View on GitHub
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
☆169Nov 18, 2024Updated last year
jacklishufan / Reflect-DiT
View on GitHub
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆56Aug 16, 2025Updated 11 months ago
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
happyw1nd / MFD
View on GitHub
[ICML 2026] The official implementation of "Mean Flow Distillation: Robust and Stable Distillation for Flow Matching Models".
☆23Jun 14, 2026Updated last month
jimmyxu123 / SELECT
View on GitHub
This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"
☆16Oct 8, 2024Updated last year
jnypark / VideoMamba
View on GitHub
☆27Jun 4, 2024Updated 2 years ago
sdbds / florence2-ft-advanced
View on GitHub
finetune your florence2 model easy
☆21Jul 27, 2024Updated last year
YasminZhang / EBAMA
View on GitHub
[ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…
☆16May 24, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 3 months ago
sail-sg / Meta-Unlearning
View on GitHub
☆35Apr 22, 2025Updated last year
OpenCausaLab / CELLO
View on GitHub
☆22Nov 5, 2024Updated last year
sail-sg / scaling-with-vocab
View on GitHub
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆112Sep 26, 2024Updated last year
gemlab-vt / motionshop
View on GitHub
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
☆26Dec 12, 2024Updated last year
zhaoshitian / LeX-Art
View on GitHub
Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"
☆85Aug 25, 2025Updated 11 months ago
arthur-qiu / FreeTraj
View on GitHub
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
☆114Sep 19, 2025Updated 10 months ago
Nithin-GK / MaxFusion
View on GitHub
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
☆27Nov 2, 2024Updated last year
facebookresearch / unibench
View on GitHub
Python Library to evaluate VLM models' robustness across diverse benchmarks
☆227Jun 30, 2026Updated 3 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Picsart-AI-Research / Zero-Painter
View on GitHub
🔥 [CVPR 2024] The official repo for Zero-Painter!
☆70Jun 8, 2024Updated 2 years ago
Vchitect / FasterCache
View on GitHub
[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
☆264Dec 27, 2024Updated last year
NJU-PCALab / RAG-Diffusion
View on GitHub
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
☆622Dec 12, 2025Updated 7 months ago
PardoAlejo / MatchDiffusion
View on GitHub
☆23Mar 16, 2026Updated 4 months ago
sail-sg / AnyDoor
View on GitHub
AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models
☆61Apr 8, 2024Updated 2 years ago
beichenzbc / Long-CLIP
View on GitHub
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
☆901Aug 13, 2024Updated last year
liuxiaoyu1104 / InstanceControl
View on GitHub
[ECCV 2026] Controllable Complex Image Generation without Instance Labeling
☆20Jul 1, 2026Updated 3 weeks ago