ide-cap-chan is a utility for batch image captioning with natural language using various VL models
☆14May 8, 2026Updated last month
Alternatives and similar repositories for ide-cap-chan
Users that are interested in ide-cap-chan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jan 15, 2025Updated last year
- Multiturn VLM Bulk captioning using your api service☆38May 2, 2026Updated last month
- A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with☆16Jun 3, 2024Updated 2 years ago
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- Swiftly get tons of images from indexed tars on Huggingface☆80Dec 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37Mar 20, 2024Updated 2 years ago
- A powerful extension for ComfyUI that enables adding notes to any node in your workflow.☆16Apr 20, 2025Updated last year
- adds a few extra samplers and schedulers to the dropdowns in recent A1111-derived webUIs for Stable Diffusion☆25Dec 5, 2025Updated 6 months ago
- A Demofusion extension for stable-diffusion-webui☆23Apr 21, 2024Updated 2 years ago
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆48Dec 31, 2024Updated last year
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- Modifications to ComfyUI diffusers wrapper node X-Adapter to make it more ComfyUI-node-like☆22Jun 27, 2024Updated last year
- simple diffusers based implementation of Hunyuan-DiT, in Forge webUI for Stable Diffusion. Works with 8GB VRAM.☆17Feb 20, 2025Updated last year
- Stable Diffusion Model Checkpoint Merger☆42Nov 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Oct 19, 2024Updated last year
- Implementation of FlashAttention-2 for Nvidia Tesla V100☆152Updated this week
- A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy☆79Jun 2, 2026Updated 2 weeks ago
- ComfyUI GlitchNodes☆70Jun 10, 2026Updated last week
- ☆50Mar 20, 2024Updated 2 years ago
- ☆25Dec 14, 2024Updated last year
- First PuLID implementation for FLUX.2 — Consistent face identity in ComfyUI☆105May 21, 2026Updated 3 weeks ago
- FoleyCrafter is a video-to-audio generation framework which can produce realistic sound effects semantically relevant and synchronized wi…☆67May 29, 2025Updated last year
- The Best F-Chat 3.0 Client, No exceptions!☆46Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AUTOMATIC1111 UI custom script for img2img around face with different "Denoising Strength" settings☆25Jan 11, 2023Updated 3 years ago
- ☆57Oct 10, 2025Updated 8 months ago
- A tool for tagging and preparing images for training text to image models.☆26May 16, 2026Updated last month
- ComfyUI InstructIR☆78May 22, 2024Updated 2 years ago
- Go command line app to exploit file upload vulnerability☆12Feb 8, 2017Updated 9 years ago
- Adds alert blockquote support to VS Code's built-in markdown preview☆13Dec 2, 2023Updated 2 years ago
- ☆78Aug 26, 2025Updated 9 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆22Jun 26, 2025Updated 11 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆81Jul 19, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An interactive sketching and drawing node for ComfyUI with stylus/pen support – built for fast, intuitive scribbling directly inside your…☆64Aug 9, 2025Updated 10 months ago
- aesthetic for comfy ui☆35Jun 17, 2024Updated 2 years ago
- Uses DARE to merge LoRA stacks as a ComfyUI node☆38May 22, 2024Updated 2 years ago
- Custom ComfyUI nodes using pytorch360convert☆37Sep 22, 2025Updated 8 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆25May 16, 2025Updated last year
- Custom nodes for ComfyUI to generate empty latent space compatible with Hunyuan models for both image and video generation.☆10Dec 29, 2024Updated last year
- A clean beamer/ltx-talk theme with a big title graphic☆21Jun 11, 2026Updated last week