Fill in the key and URL to quickly call GPT4V to annotate images
☆27Apr 6, 2025Updated 11 months ago
Alternatives and similar repositories for ComfyUI-GPT4V-Image-Captioner
Users that are interested in ComfyUI-GPT4V-Image-Captioner are comparing it to the libraries listed below
Sorting:
- ComfyUI simple node based on BLIP method, with the function of Image to Txt☆10Dec 6, 2024Updated last year
- A simple set of nodes for making an image fit within a bounding box in comfyui☆52May 22, 2024Updated last year
- Simple wrapper to try out ELLA in ComfyUI using diffusers☆113May 21, 2024Updated last year
- Eagle Plugin☆21Aug 27, 2024Updated last year
- ☆24Jun 14, 2024Updated last year
- ☆11Jul 29, 2024Updated last year
- ☆12Dec 20, 2025Updated 2 months ago
- Custom node for ComfyUI/Stable Diffustion☆194Jul 10, 2025Updated 7 months ago
- Simple DeepSeek-VL inference in ComfyUI☆50May 21, 2024Updated last year
- a node for AuraSR☆22Jun 27, 2024Updated last year
- ☆15Jun 14, 2024Updated last year
- A small collection of custom nodes for use with ComfyUI, for geometry calculations☆12Sep 30, 2024Updated last year
- ☆12May 23, 2024Updated last year
- A comfyui costume node by BillBum for using api gen (VLM LLM T2I API Tools)☆10Feb 4, 2026Updated last month
- A simple download tool for using pipeline in comfyUI☆10Aug 5, 2024Updated last year
- ☆30Aug 20, 2024Updated last year
- Unofficial implementation of PixArt-alpha-Diffusers for ComfyUI☆51May 22, 2024Updated last year
- Llama3_8B for comfyUI, using pipeline workflow☆27Jun 25, 2024Updated last year
- You can call ChatGLM's API in ComfyUI to translate and describe pictures☆25Jul 31, 2024Updated last year
- Nodes: Wildcard Processor, Get File Path, Save Text File, Download Image from URL, Tiktoken Tokenizer, String Cleaning, String Text Split…☆95Feb 27, 2026Updated last week
- ☆13Nov 27, 2023Updated 2 years ago
- ☆13May 23, 2024Updated last year
- ☆35Jan 14, 2026Updated last month
- A plugin for ComfyUI use the Microsoft Speech TTS convert text to MP3 file, also includes playing sound and matching trigger nodes☆29Mar 31, 2025Updated 11 months ago
- JoyTag is a state of the art AI vision model for tagging images, with a focus on sex positivity and inclusivity. It uses the Danbooru tag…☆66May 22, 2024Updated last year
- ☆32May 22, 2024Updated last year
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- A PyQt GUI for ESRGAN☆14May 25, 2022Updated 3 years ago
- Help everyone quickly call RH's API on coze to implement various wonderful AI applications☆16Jan 15, 2025Updated last year
- A collection of custom nodes and workflows for ComfyUI maintained by https://eden.art/☆108Updated this week
- Front end ComfyUI nodes for CartoonSegmentation☆17May 22, 2024Updated last year
- set of tools☆18Dec 7, 2025Updated 3 months ago
- Nodes for ComfyUI to simply workflows☆72Mar 2, 2026Updated last week
- ☆853Jan 17, 2025Updated last year
- finetune your florence2 model easy☆19Jul 8, 2024Updated last year
- ComfyUI Implementaion of ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆161Apr 27, 2024Updated last year
- MuseTalk audio driven face inpainting☆69May 21, 2024Updated last year
- Unofficial implementation of DepthFM for ComfyUI☆75May 22, 2024Updated last year
- Unofficial implementation of MiniCPM-V and MiniCPM-V-2 in ComfyUI☆42Aug 9, 2024Updated last year