Qiyuan-Ge / PaintMind
Fast and controllable text-to-image model.
☆40Updated last year
Alternatives and similar repositories for PaintMind:
Users that are interested in PaintMind are comparing it to the libraries listed below
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆66Updated 7 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆44Updated last year
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- ☆83Updated last year
- ☆72Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 8 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆79Updated last year
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆39Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 9 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated 2 months ago
- JAX implementation ViT-VQGAN☆80Updated 2 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- Training code for CLIP-FlanT5☆22Updated 6 months ago
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆45Updated last month
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆81Updated last month
- ☆43Updated 4 months ago
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- ☆98Updated this week
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆116Updated this week
- ☆53Updated last year
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆165Updated last year
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆121Updated 7 months ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- ElasticTok: Adaptive Tokenization for Image and Video☆49Updated 2 months ago
- ☆75Updated 2 months ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆34Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆35Updated last week
- ☆10Updated last year