kjerk / instructblip-pipelineLinks
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
☆33Updated 2 years ago
Alternatives and similar repositories for instructblip-pipeline
Users that are interested in instructblip-pipeline are comparing it to the libraries listed below
Sorting:
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models☆21Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆64Updated last year
- ☆31Updated last year
- ☆206Updated last year
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆139Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 8 months ago
- finetune your florence2 model easy☆20Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)☆175Updated last year
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆62Updated 7 months ago
- ☆128Updated last month
- ☆30Updated 2 years ago
- ☆64Updated 4 months ago
- ☆24Updated last year
- Useful utilities for huggingface☆21Updated 3 weeks ago
- Training InstructPi2Pix with SDXL.☆18Updated 2 years ago
- DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models☆130Updated last month
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆157Updated 9 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆47Updated last year
- ☆90Updated last year
- ☆13Updated last year
- ☆61Updated 2 years ago
- ☆60Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆31Updated 5 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆173Updated 2 years ago
- BLIP2 captioning tool as an extension of AUTOMATIC's WebUI☆59Updated 2 years ago
- XGEN-MM(BLIP3) Autocaptioning Tools☆16Updated last year