A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
☆40Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for wd-llm-caption-cli
Users that are interested in wd-llm-caption-cli are comparing it to the libraries listed below
Sorting:
- GUI for the new musubi-tuner☆53Jan 25, 2025Updated last year
- easyanimate generete videos with ExLlamaV2 quantization LLM prompt☆13Jun 26, 2024Updated last year
- a.k.a autoMBW-V2☆10Sep 6, 2024Updated last year
- Bagel but with Gradio Interface☆20May 21, 2025Updated 9 months ago
- GUI for Joy caption beta one☆18Nov 16, 2025Updated 3 months ago
- ☆12Dec 15, 2025Updated 2 months ago
- joy-caption-alpha-two -cli mod and gui mod☆91Apr 27, 2025Updated 10 months ago
- ComfyUI Fictiverse custom nodes☆20Feb 12, 2026Updated 2 weeks ago
- JoyTag is a state of the art AI vision model for tagging images, with a focus on sex positivity and inclusivity. It uses the Danbooru tag…☆65May 22, 2024Updated last year
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 3 months ago
- stable-diffusion-webui-images-browser☆14Jan 12, 2023Updated 3 years ago
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- text-image dataset maker for anime-style images☆102Mar 3, 2025Updated last year
- ☆40Jun 30, 2024Updated last year
- Swiftly get tons of images from indexed tars on Huggingface☆77Dec 19, 2024Updated last year
- ☆11Updated this week
- ☆31Feb 12, 2026Updated 2 weeks ago
- ComfyUI-joycaption-beta-one-GGUF Node for ComfyUI☆57Oct 26, 2025Updated 4 months ago
- ComfyUI custom nodes for Ovi joint video+audio generation☆46Oct 6, 2025Updated 4 months ago
- download images and meta data together from civitai☆21Jun 25, 2025Updated 8 months ago
- A collection of custom nodes for ComfyUI.☆28Jun 28, 2025Updated 8 months ago
- Frontend nodes to make comfyui comfier☆39Sep 9, 2025Updated 5 months ago
- Describe a single image or all images in a directory using models such as Janus Pro, Florence2, or JoyCaption (coming soon), with a parti…☆120Oct 9, 2025Updated 4 months ago
- ☆22Sep 4, 2023Updated 2 years ago
- Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)☆53Aug 29, 2024Updated last year
- A SDXL compatible T2I-adapter implementation using Diffusers including a training script☆27Aug 3, 2023Updated 2 years ago
- Modifications to ComfyUI diffusers wrapper node X-Adapter to make it more ComfyUI-node-like☆22Jun 27, 2024Updated last year
- ComfyUI QwenVL and Qwen wrapper☆135Nov 29, 2025Updated 3 months ago
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆27Dec 13, 2025Updated 2 months ago
- Virtuoso Nodes - ComfyUI Node Suite☆92Apr 19, 2025Updated 10 months ago
- A SDXL trainer modified from kohya trainer.☆24Dec 3, 2025Updated 3 months ago
- ☆33Apr 13, 2025Updated 10 months ago
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Updated this week
- a.k.a autoMBW-V2☆28Mar 26, 2024Updated last year
- LoRA Explorer model to run with multiple LoRAs using Flux.1[Dev] as the base model☆27Jun 11, 2025Updated 8 months ago
- Artistic style transfer has been part of the quickly growing AI Art community in recent times. Pioneered by Gatys et al this class of met…☆30Mar 14, 2022Updated 3 years ago
- short code snippets of in class examples separated by date.☆19Dec 22, 2020Updated 5 years ago
- Some nodes in future☆23Jul 19, 2024Updated last year
- Powerful video frame manipulation nodes for ComfyUI such as: efficient high quality batch scaling, arbitrary framerate resampling, seamle…☆54Nov 17, 2025Updated 3 months ago