kjerk / instructblip-pipelineLinks
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
☆31Updated last year
Alternatives and similar repositories for instructblip-pipeline
Users that are interested in instructblip-pipeline are comparing it to the libraries listed below
Sorting:
- CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models☆20Updated 8 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆60Updated 2 months ago
- Python scripts to use for captioning images with VLMs☆40Updated last month
- finetune your florence2 model easy☆20Updated 10 months ago
- (IA)^3 for Stable Diffusion☆34Updated 2 years ago
- ☆30Updated last year
- ☆41Updated last year
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated last year
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆79Updated 8 months ago
- Useful utilities for huggingface☆21Updated last month
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editing☆55Updated last month
- A notebook-based web UI for DeepFloyd IF☆25Updated last year
- ☆128Updated 8 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆15Updated 5 months ago
- ☆61Updated last year
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆133Updated 11 months ago
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆79Updated 7 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆78Updated 11 months ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆45Updated 11 months ago
- BLIP2 captioning tool as an extension of AUTOMATIC's WebUI☆60Updated 2 years ago
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year
- ☆91Updated last year
- Video2Video Framework for ComfyUI☆61Updated 9 months ago
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆67Updated last year
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆59Updated 6 months ago