mhh0318 / UniD3
☆53Updated 2 years ago
Alternatives and similar repositories for UniD3:
Users that are interested in UniD3 are comparing it to the libraries listed below
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆54Updated last year
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆30Updated 3 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆37Updated 2 years ago
- ☆43Updated 5 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 2 months ago
- ☆50Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆44Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 2 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆165Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆79Updated 10 months ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆156Updated last year
- ☆57Updated last year
- [NeurIPS 2022] (Amortized) distributional control for pre-trained generative models☆119Updated last year
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆38Updated last year
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆80Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆33Updated 11 months ago
- https://arxiv.org/abs/2209.15162☆49Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆46Updated 4 months ago
- ☆56Updated 9 months ago