JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
☆1,105Feb 24, 2026Updated 3 weeks ago
Alternatives and similar repositories for joycaption
Users that are interested in joycaption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ComfyUI Node☆715Jun 18, 2025Updated 9 months ago
- JoyCaption ComfyUI Nodes☆120Feb 25, 2026Updated 3 weeks ago
- joy-caption-alpha-two -cli mod and gui mod☆91Apr 27, 2025Updated 10 months ago
- A batch captioning tool for joy_caption☆195Aug 25, 2025Updated 6 months ago
- Tag manager and captioner for image datasets☆1,280Oct 11, 2025Updated 5 months ago
- The ultimate training toolkit for finetuning diffusion models☆9,778Mar 10, 2026Updated last week
- OneTrainer is a one-stop solution for all your Diffusion training needs.☆2,859Mar 15, 2026Updated last week
- Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2☆623Feb 6, 2025Updated last year
- ☆1,745Mar 6, 2026Updated 2 weeks ago
- ☆1,359Apr 21, 2025Updated 11 months ago
- ComfyUI Plugin of Nunchaku☆2,810Feb 19, 2026Updated last month
- Official repository of In-Context LoRA for Diffusion Transformers☆2,061Dec 20, 2024Updated last year
- Inference Microsoft Florence2 VLM☆1,635Mar 12, 2026Updated last week
- A pipeline parallel training script for diffusion models.☆1,889Feb 8, 2026Updated last month
- Dead simple FLUX LoRA training UI with LOW VRAM support☆3,187Apr 1, 2025Updated 11 months ago
- Nodes for image juxtaposition for Flux in ComfyUI☆1,397Jan 9, 2025Updated last year
- GGUF Quantization support for native ComfyUI models☆3,402Jan 12, 2026Updated 2 months ago
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,792Mar 14, 2026Updated last week
- The JoyTag Image Tagging Model☆555May 18, 2024Updated last year
- A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.☆940Dec 21, 2025Updated 3 months ago
- ☆6,956Updated this week
- Fork of the Triton language and compiler for Windows support and easy installation☆1,882Feb 18, 2026Updated last month
- Various training scripts used to train bigasp☆113Aug 13, 2025Updated 7 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,739Mar 7, 2026Updated 2 weeks ago
- ☆1,120Apr 2, 2025Updated 11 months ago
- ☆515Apr 26, 2025Updated 10 months ago
- An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and…☆152Mar 14, 2026Updated last week
- Nodes for better inpainting with ComfyUI: Fooocus inpaint model for SDXL, LaMa, MAT, and various other tools for pre-filling inpaint & ou…☆1,154Feb 27, 2026Updated 3 weeks ago
- https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.☆1,223Aug 2, 2025Updated 7 months ago
- A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.☆2,940Jan 30, 2026Updated last month
- ComfyUI's ControlNet Auxiliary Preprocessors☆3,852Feb 16, 2026Updated last month
- Various custom nodes for ComfyUI☆2,417Updated this week
- ☆1,641Jan 13, 2026Updated 2 months ago
- ☆1,071Jul 12, 2025Updated 8 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆693Nov 28, 2024Updated last year
- ☆1,544Aug 7, 2025Updated 7 months ago
- ☆1,697Oct 30, 2024Updated last year
- ☆2,586Aug 20, 2025Updated 7 months ago
- Multimodal captioner☆220Updated this week