cobanov / image-captioningLinks
Image captioning using python and BLIP
☆49Updated 2 years ago
Alternatives and similar repositories for image-captioning
Users that are interested in image-captioning are comparing it to the libraries listed below
Sorting:
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆53Updated 2 years ago
- BSRGAN-Pip: Packaged version of the BSRGAN repository☆14Updated 2 years ago
- Fine-tuning code for CLIP models☆250Updated last month
- ☆16Updated 2 years ago
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆62Updated 5 months ago
- This is a A.I Dev Page for English Translations of the original documents by kohya-ss☆105Updated last year
- GIT/BLIP/CLIP Caption tool☆140Updated 2 years ago
- LoRA (Low-Rank Adaptation) inspector for Stable Diffusion☆101Updated 2 months ago
- This project is under development.☆23Updated 2 years ago
- Training and generation / detection / inference scripts dealing with Yolov8☆67Updated last year
- Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI☆66Updated last year
- A gradio web UI demo for Stable Diffusion XL 1.0, with refiner and MultiGPU support☆280Updated last year
- ☆31Updated 2 years ago
- A library to scrape and resize google images, focusing on faces - mainly for machine learning (Stable Diffusion)☆30Updated 2 years ago
- CPU version of InstantID☆58Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- Prompt Generator for Stable Diffusion/Midjourney on GPT-2 models☆95Updated 2 years ago
- ☆27Updated 2 years ago
- Fine-tuning of diffusion models☆99Updated 2 years ago
- This is a wrapper of rem_bg for auto1111's stable diffusion gui. It can do clothing segmentation, background removal, and background mask…☆79Updated last year
- Some tips on using stable diffusion inpainting with diffusers☆13Updated 2 years ago
- Awesome repo for ControlNet☆97Updated 2 years ago
- SAM + CLIP + DIFFUSION for image to edit objects in images using plain text☆15Updated 2 years ago
- ☆119Updated last year
- Pre-Rendered Regularization Images fou use with fine-tuning, especially for the current implementation of "Dreambooth"☆38Updated 2 years ago
- WebUI extension for ControlNet, supports LoRA version of ControlNet☆109Updated 2 years ago
- ☆96Updated last year
- ☆437Updated last year
- Generate long weighted prompt embeddings for Stable Diffusion☆134Updated 4 months ago
- Embedding editor extension for web ui☆74Updated last year