The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆82Apr 23, 2025Updated last year
Alternatives and similar repositories for LongAlign
Users that are interested in LongAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)☆20Feb 7, 2024Updated 2 years ago
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 11 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆121Nov 14, 2024Updated last year
- ☆20Apr 16, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆62Jan 22, 2025Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆118Sep 26, 2024Updated last year
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated last year
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated 3 months ago
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness☆48Apr 26, 2024Updated 2 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆169Nov 18, 2024Updated last year
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆145Jan 27, 2025Updated last year
- Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)☆33Nov 30, 2025Updated 5 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆54Jul 15, 2025Updated 9 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 8 months ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 7 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Mar 27, 2026Updated last month
- ☆27Jun 4, 2024Updated last year
- ☆22Nov 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…☆15May 24, 2025Updated 11 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 5 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆113Sep 19, 2025Updated 7 months ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆32Oct 2, 2025Updated 7 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆83Aug 25, 2025Updated 8 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆70Jun 8, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆619Dec 12, 2025Updated 4 months ago
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆897Aug 13, 2024Updated last year
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 5 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆63Jan 5, 2026Updated 4 months ago
- Python Library to evaluate VLM models' robustness across diverse benchmarks☆223Oct 20, 2025Updated 6 months ago
- [ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆80Mar 31, 2026Updated last month