42lux / CaptainCaptionLinks

A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.

☆60

Alternatives and similar repositories for CaptainCaption

Users that are interested in CaptainCaption are comparing it to the libraries listed below

Sorting:

comfy-deploy / comfyui-deploy-gradio-demo
Gradio Demo for ComfyDeploy
☆54Updated last year
neph1 / finetrainers-ui
Gradio UI for training video models using finetrainers
☆30Updated 4 months ago
fofr / cog-flux-layers-explorer
Explore how Flux Dev responds when you change the strengths of layers in the model.
☆20Updated 11 months ago
EnVision-Research / DisEnvisioner
Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
☆119Updated 7 months ago
camenduru / stable-audio-jupyter
☆13Updated last year
OutofAi / OutofFocus
An AI focused photo manipulation tool based on Gradio
☆186Updated last month
camenduru / TokenFlow-colab
☆22Updated last year
camenduru / mimic-motion-tost
☆23Updated 10 months ago
logtd / ComfyUI-Veevee
Video2Video Framework for ComfyUI
☆63Updated last year
deepbeepmeep / Cosmos1GP
Cosmos1GP for the GPU Poor by DeepBeepMeep
☆74Updated 6 months ago
DEVAIEXP / mod-control-tile-upscaler-sdxl
MoD Control Tile Upscaler for SDXL Pipeline
☆60Updated 5 months ago
camenduru / VisualStylePrompting-jupyter
☆13Updated last year
camenduru / PuLID-jupyter
☆32Updated 9 months ago
martintomov / comfy-anything
Community ComfyUI workflows running on fal.ai
☆58Updated 11 months ago
camenduru / MusePose-jupyter
☆18Updated last year
camenduru / InstantID-jupyter
☆20Updated last year
zer0int / Long-CLIP
Scripts for use with LongCLIP, including fine-tuning Long-CLIP
☆62Updated 5 months ago
Anashel-RPG / anashel-utils
Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1
☆81Updated 11 months ago
woct0rdho / ComfyUI-RadialAttn
RadialAttention in ComfyUI native workflow
☆52Updated 2 weeks ago
camenduru / FreeInit-colab
☆25Updated last year
KunpengSong / MoMA-inactive
[inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
☆13Updated last year
Dango233 / ComfyUI-HunyuanVideoWrapper-IP2V
☆44Updated 8 months ago
camenduru / CCSR-colab
☆17Updated last year
songrise / Artist
Official repo for DiffArtist (ACM MM 2025)
☆122Updated last month
zsxkib / cog-comfyui-hunyuan-video
☆17Updated 7 months ago
camenduru / sliders-colab
☆32Updated last year
ShmuelRonen / ComfyUI-HunyuanVideoStyler
A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…
☆44Updated 7 months ago
Nojahhh / cogvideox-loras
CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …
☆81Updated 8 months ago
Binxly / sd3-training
sd3 dreambooth lora training book, adapted from the diffusers doc
☆45Updated last year
camenduru / text-behind-tost
☆57Updated 9 months ago