cobanov / image-captioningLinks
Image captioning using python and BLIP
☆49Updated 2 years ago
Alternatives and similar repositories for image-captioning
Users that are interested in image-captioning are comparing it to the libraries listed below
Sorting:
- Fine-tuning code for CLIP models☆253Updated 2 months ago
- This project is under development.☆23Updated 2 years ago
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆62Updated 6 months ago
- ☆16Updated 2 years ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆53Updated 2 years ago
- Fine tuning OpenAI's CLIP model on Indian Fashion Dataset☆51Updated 2 years ago
- GroundedSAM Base Model plugin for Autodistill☆52Updated last year
- finetune your florence2 model easy☆18Updated last year
- GIT/BLIP/CLIP Caption tool☆140Updated 2 years ago
- This is a A.I Dev Page for English Translations of the original documents by kohya-ss☆105Updated last year
- Training and generation / detection / inference scripts dealing with Yolov8☆68Updated last year
- Templating language for generating prompts for text to image generators such as Stable Diffusion☆147Updated last year
- Huggingface utilities for Ultralytics/YOLOv8☆87Updated last year
- Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help custome…☆60Updated 2 years ago
- BSRGAN-Pip: Packaged version of the BSRGAN repository☆14Updated 2 years ago
- Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)☆683Updated 4 months ago
- Advanced fine tuning tools for vision models☆226Updated 2 years ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- Web UI for Stable Diffusion prompt generation via GPT-2 trained model☆51Updated 2 years ago
- Various training scripts used to train bigasp☆102Updated last month
- LoRA (Low-Rank Adaptation) inspector for Stable Diffusion☆100Updated 3 months ago
- A component that allows you to annotate an image with points and boxes.☆21Updated last year
- ☆321Updated last year
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- ☆22Updated last year
- automatic image inpainting (lama(with refinement) and maskdino)☆41Updated 2 years ago
- This repo is a packaged version of the Yolov9 model.☆89Updated 2 weeks ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated 2 years ago
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆38Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year