cobanov / image-captioning
Image captioning using python and BLIP
☆45Updated last year
Alternatives and similar repositories for image-captioning:
Users that are interested in image-captioning are comparing it to the libraries listed below
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆51Updated last year
- ☆21Updated 8 months ago
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆58Updated 4 months ago
- This project is under development.☆23Updated last year
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated last year
- Roboflow Workflows on ComfyUI☆32Updated 5 months ago
- ☆40Updated 11 months ago
- An End-to-End Guide for Learning Stable Diffusion - From Noob to Expert☆37Updated last year
- A notebook-based web UI for DeepFloyd IF☆24Updated 9 months ago
- ☆16Updated last year
- ComfyUI node for fast neural style transfer☆71Updated 6 months ago
- This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creati…☆28Updated 9 months ago
- A "loopback on steroids" type of extension for Stable Diffusion Web UI.☆27Updated this week
- Fine-tuning code for CLIP models☆204Updated 3 months ago
- Training and generation / detection / inference scripts dealing with Yolov8☆57Updated 6 months ago
- Custom nodes for using fal API. Video generation with Kling, Runway, Luma. Image generation with Flux. LLMs and VLMs OpenAI, Claude, Llam…☆92Updated 3 weeks ago
- ☆30Updated last year
- Data research, preparation, and manipulation nodes for model trainers and artists.☆48Updated last week
- Testbed for the fastest SD pipelines☆35Updated last year
- AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation☆22Updated 11 months ago
- 📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stabi…☆19Updated last month
- finetune your florence2 model easy☆16Updated 8 months ago
- ComfyUI Workflows☆41Updated 3 months ago
- A library to scrape and resize google images, focusing on faces - mainly for machine learning (Stable Diffusion)☆30Updated 2 years ago
- LoRA (Low-Rank Adaptation) inspector for Stable Diffusion☆95Updated 5 months ago
- Apply unlimited masks to unlimited LoRA models☆48Updated last year
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated 10 months ago
- A powerful and user-friendly tool that generates detailed captions for your images☆21Updated 3 months ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆57Updated 3 months ago