Qiyuan-Ge / PaintMind
Fast and controllable text-to-image model.
☆40Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PaintMind
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆27Updated 3 weeks ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆76Updated 6 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆61Updated 5 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆94Updated 6 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆60Updated 6 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆32Updated 2 weeks ago
- ☆71Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆32Updated 8 months ago
- ☆78Updated 10 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"☆31Updated last month
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆28Updated 2 years ago
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆38Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- [arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆84Updated 5 months ago
- Official code base for paper EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guid…☆35Updated last month
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆26Updated 8 months ago
- ☆44Updated last month
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆75Updated 4 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- ☆10Updated last year
- ☆15Updated 3 months ago
- Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)☆27Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆36Updated 3 months ago
- ☆48Updated last year
- ☆102Updated 4 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago