nousr / dream-bench
A tool for benchmarking image generation models.
☆32Updated 2 years ago
Alternatives and similar repositories for dream-bench
Users that are interested in dream-bench are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 2 years ago
- The implementation for Accelerating Guided Diffusion Sampling with Splitting Numerical Methods (2023)☆48Updated 2 years ago
- ☆73Updated 2 years ago
- ☆29Updated 2 years ago
- Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)☆37Updated last year
- ☆21Updated 2 years ago
- Training InstructPi2Pix with SDXL.☆18Updated last year
- ☆108Updated 2 years ago
- Generate images from an initial frame and text☆37Updated last year
- ☆45Updated 9 months ago
- Adaptation of Stable Diffusion with extra prompt guidance from images... An attempt at making the most flexible pipeline that will allow …☆47Updated 2 years ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Updated last year
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 5 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆63Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆75Updated 11 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆41Updated last year
- Colab implementation of Google's null-text inversion.☆38Updated 2 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 6 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Updated 3 years ago
- WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders☆26Updated last year
- ☆73Updated last year
- ☆21Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- A retrain of AnimateDiff to be conditional on an init image☆34Updated last year
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated last year
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆41Updated 2 years ago