luping-liu / LongAlign
The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆57Updated last month
Related projects ⓘ
Alternatives and complementary repositories for LongAlign
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆41Updated 3 weeks ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆60Updated last month
- Vico: Compositional Video Generation as Flow Equalization☆52Updated this week
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆44Updated 2 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆63Updated last week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆81Updated 3 weeks ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆60Updated 6 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"☆61Updated 5 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆91Updated 2 weeks ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆61Updated 5 months ago
- ☆78Updated 3 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆38Updated 3 months ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- ☆40Updated 11 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated 4 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆44Updated 3 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 6 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆61Updated 5 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 7 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆110Updated 2 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"☆44Updated 7 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆122Updated 5 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆31Updated last week
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆82Updated last month
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆84Updated 7 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆102Updated 6 months ago