sakharok13 / Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception
☆16Updated 6 months ago
Alternatives and similar repositories for Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception:
Users that are interested in Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception are comparing it to the libraries listed below
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- ☆21Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆46Updated 4 months ago
- Video Diffusion State Space Models☆19Updated 10 months ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆24Updated this week
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 9 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆33Updated 11 months ago
- ☆19Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆44Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆33Updated 3 weeks ago
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆39Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆79Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated 9 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆51Updated 6 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆21Updated 4 months ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆41Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆45Updated last year
- ☆45Updated this week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆49Updated 4 months ago
- Stable Consistency Tuning: Understanding and Improving Consistency models☆16Updated 3 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆68Updated 8 months ago
- A curated list of papers and resources for text-to-image evaluation.☆27Updated last year
- ORES: Open-vocabulary Responsible Visual Synthesis☆13Updated last year
- (arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆32Updated 3 months ago
- Training code for CLIP-FlanT5☆24Updated 6 months ago