C0nsumption / Consume-Blip3Links
XGEN-MM(BLIP3) Autocaptioning Tools
☆16Updated last year
Alternatives and similar repositories for Consume-Blip3
Users that are interested in Consume-Blip3 are comparing it to the libraries listed below
Sorting:
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆29Updated last month
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 5 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆35Updated 8 months ago
- ☆17Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 4 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Updated 9 months ago
- Balanced Image Stylization with Style Matching Score☆29Updated 2 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 6 months ago
- [AAAI 2025] LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation☆43Updated 5 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆51Updated this week
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Updated 5 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆52Updated last year
- ☆46Updated 7 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆36Updated 4 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆63Updated last year
- ☆131Updated 3 months ago
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆51Updated 6 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆15Updated 7 months ago
- ☆18Updated 3 weeks ago
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆67Updated last year
- Code for full fintuing Mochi model with FSDP (and CP)☆27Updated 2 months ago
- Official repository of IDEA-Bench☆35Updated 5 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆38Updated last month
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆81Updated 11 months ago
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024☆105Updated 6 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆113Updated 3 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆107Updated last year
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editing☆56Updated last month
- Official pytorch implementation for SingleInsert☆27Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆62Updated 3 months ago