victorchall/vlm-caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/victorchall/vlm-caption)

victorchall / vlm-caption

Multiturn VLM Bulk captioning using your api service

☆39

Alternatives and similar repositories for vlm-caption

Users that are interested in vlm-caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Merserk / Caption-Creator
View on GitHub
Caption Creator is a fast and portable tool for generating high-quality image captions and tags - ideal for custom dataset creation. Work…
☆30Jun 25, 2026Updated last month
2dameneko / ide-cap-chan
View on GitHub
ide-cap-chan is a utility for batch image captioning with natural language using various VL models
☆14May 8, 2026Updated 2 months ago
zeeoale / PromptCreatorV2
View on GitHub
🔮 A powerful and stylish Prompt Generator powered by OpenAI and Python. Includes a built-in JSON editor, modular prompt libraries, and f…
☆20Jul 12, 2025Updated last year
neph1 / finetrainers-ui
View on GitHub
Gradio UI for training video models using finetrainers
☆35Apr 18, 2025Updated last year
Merserk / sd-webui-forge-universal-portable
View on GitHub
A universal, portable installer and launcher for Stable Diffusion WebUI Forge (Classic & Neo).
☆19Jul 19, 2026Updated last week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
demibit / stable-toolkit
View on GitHub
fully local image viewer alternative for stable diffusion
☆56Feb 11, 2023Updated 3 years ago
bmaltais / kohya_diffusers_fine_tuning
View on GitHub
☆16Dec 20, 2022Updated 3 years ago
Pikselkroken / pixlstash
View on GitHub
PixlStash helps you find things in an image library that's gotten out of hand. It imports and tags your images automatically, then lets y…
☆78Updated this week
modl-org / modl
View on GitHub
Local-first AI image generation toolkit. Pull models, train LoRAs, generate images. One CLI, no glue code.
☆25Updated this week
ThereforeGames / blora_for_kohya
View on GitHub
Tools needed for training B-LoRA method with sd-scripts.
☆53Aug 7, 2024Updated last year
tritant / ComfyUI_Layers_Utility
View on GitHub
☆70Oct 7, 2025Updated 9 months ago
lazniak / LiquidTime-Interpolation
View on GitHub
LiquidTime is a simple yet powerful frame interpolation node for ComfyUI. Just input your sequence and desired frame count - the node han…
☆13Apr 3, 2025Updated last year
jhc13 / taggui
View on GitHub
Tag manager and captioner for image datasets
☆1,335Oct 11, 2025Updated 9 months ago
AlekPet / Fooocus_Extensions_AlekPet
View on GitHub
Extensions for Fooocus
☆15Sep 5, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bbc-mc / sdweb-eagle-transfer
View on GitHub
Send images to Eagle with PNGinfo from directory. Extension for Stable Diffusion UI by AUTOMATIC1111
☆12Dec 13, 2022Updated 3 years ago
dawidope / PowerLink
View on GitHub
NTFS link tool for Windows — deduplicate identical files in place with hardlinks, clone directories without copying data, create director…
☆19Jun 17, 2026Updated last month
Ltamann / ComfyUI-TBG-ETUR
View on GitHub
TBG Enhanced Tiled Upscaler and Refiner upscales up to 200MP with precise control. It features dual-model processing (structure + detail)…
☆146Jul 1, 2026Updated 3 weeks ago
camenduru / GVHMR-jupyter
View on GitHub
☆19Dec 8, 2024Updated last year
bbc-mc / sdweb-xyplus
View on GitHub
Extension/Script for Stable Diffusion UI by AUTOMATIC1111 https://github.com/AUTOMATIC1111/stable-diffusion-webui
☆17Dec 19, 2022Updated 3 years ago
pipinstallyp / minigpt4-batch
View on GitHub
Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!
☆18Jul 20, 2023Updated 3 years ago
Gaurox / AI-Metadata-Inspector
View on GitHub
Portable Windows tool to extract AI generation metadata and quickly copy prompts from image and video files via right-click.
☆23Jun 7, 2026Updated last month
Talmendo / blip2-for-sd
View on GitHub
☆30Aug 28, 2023Updated 2 years ago
Iniquitatis / sd-webui-temporal
View on GitHub
A "loopback on steroids" type of extension for Stable Diffusion Web UI.
☆31Oct 10, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
if-ai / IF-Animation-Workflows
View on GitHub
This are a series of ComfyUI workflows that work together to create and repurpose animation
☆40Aug 10, 2025Updated 11 months ago
lachhabw / Image-Captioning-Extension-for-LM-Studio
View on GitHub
LM Studio extension for automatic image captioning.
☆14Feb 25, 2024Updated 2 years ago
chri002 / ComfyUI_depthMapOperation
View on GitHub
☆15May 27, 2025Updated last year
GizmoR13 / PG-Nodes
View on GitHub
Custom nodes for ComfyUI
☆16Oct 10, 2025Updated 9 months ago
vavo / TagPilot
View on GitHub
Privacy first powerful, browser-based tool for tagging, captioning, cropping and managing training datasets for Stable Diffusion's LoRA …
☆40Updated this week
GeekatplayStudio / Image-Express
View on GitHub
☆96Updated this week
ExoFi-Labs / Nexface
View on GitHub
☆55Jun 24, 2025Updated last year
Magirad / Flux_ID_Adjuster_V2
View on GitHub
A node created for getting identity consistency for flux.2 klein 9b model.
☆17May 31, 2026Updated last month
RamonGuthrie / ComfyUI-RBG-LoraConverter
View on GitHub
A Powerful LoRA key converter for ComfyUI
☆29Nov 17, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SGUN-father / comfyui-controlfoley
View on GitHub
神棍
☆15May 1, 2026Updated 2 months ago
d3cker / comfyui-prompt-generator
View on GitHub
ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.
☆26Jan 10, 2026Updated 6 months ago
IntellectzProductions / Comfy-UI-Workflows
View on GitHub
Download workflow here
☆18Feb 25, 2026Updated 5 months ago
PxTicks / vlo
View on GitHub
☆49May 26, 2026Updated 2 months ago
RudySen / comfyui-muse
View on GitHub
Local LLM chat panel for ComfyUI — LM Studio & Ollama, multi-session, vision support, Guide Materials
☆23Jun 21, 2026Updated last month
Fictiverse / ComfyUI_Fictiverse
View on GitHub
ComfyUI Fictiverse custom nodes
☆26May 2, 2026Updated 2 months ago
Clorr / faceswap
View on GitHub
Non official project based on original /r/Deepfakes thread. Many thanks to him!
☆12Mar 30, 2018Updated 8 years ago