ProGamerGov / VLM-Captioning-Tools
Python scripts to use for captioning images with VLMs
☆34Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for VLM-Captioning-Tools
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆77Updated last month
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆108Updated 5 months ago
- ☆117Updated 3 weeks ago
- AnimationDiff with train☆117Updated 8 months ago
- Official code for CustAny: Customizing Anything from A Single Example☆38Updated this week
- ☆75Updated last year
- Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step☆156Updated 4 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆34Updated last month
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆38Updated 3 months ago
- ☆70Updated last year
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆72Updated this week
- SigLIP-based Aesthetic Score Predictor☆142Updated last month
- Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"☆44Updated 7 months ago
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generati…☆24Updated 8 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆104Updated 2 weeks ago
- [Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆137Updated this week
- More suitable IP-Adapter for the DiT architecture☆26Updated 4 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆50Updated 7 months ago
- ☆42Updated last month
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆64Updated 7 months ago
- ☆77Updated 2 months ago
- ☆86Updated 9 months ago
- ☆32Updated 4 months ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆18Updated 6 months ago
- InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥☆65Updated 4 months ago
- The implementation of the paper "Improving Sample Quality of Diffusion Models Using Self-Attention Guidance" (ICCV`23)☆108Updated 3 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆96Updated last year
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆62Updated 5 months ago
- A retrain of AnimateDiff to be conditional on an init image☆33Updated last year