ide-cap-chan is a utility for batch image captioning with natural language using various VL models
☆14May 1, 2025Updated 10 months ago
Alternatives and similar repositories for ide-cap-chan
Users that are interested in ide-cap-chan are comparing it to the libraries listed below
Sorting:
- A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with☆15Jun 3, 2024Updated last year
- simple diffusers based implementation of Hunyuan-DiT, in Forge webUI for Stable Diffusion. Works with 8GB VRAM.☆17Feb 20, 2025Updated last year
- ☆21Jan 15, 2025Updated last year
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- Custom ComfyUI nodes using pytorch360convert☆27Sep 22, 2025Updated 5 months ago
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆46Dec 31, 2024Updated last year
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆25May 16, 2025Updated 9 months ago
- ☆74Aug 26, 2025Updated 6 months ago
- Modifications to ComfyUI diffusers wrapper node X-Adapter to make it more ComfyUI-node-like☆22Jun 27, 2024Updated last year
- ☆27Oct 19, 2024Updated last year
- ☆26Dec 14, 2024Updated last year
- AUTOMATIC1111 UI custom script for img2img around face with different "Denoising Strength" settings☆26Jan 11, 2023Updated 3 years ago
- FoleyCrafter is a video-to-audio generation framework which can produce realistic sound effects semantically relevant and synchronized wi…☆66May 29, 2025Updated 9 months ago
- ☆68Oct 7, 2025Updated 5 months ago
- extension for Forge2 webui for Stable Diffusion; adds support for multi-prompting for Flux, SD3, sdXL; Shift for Flux, Sd3; override pred…☆37Nov 30, 2025Updated 3 months ago
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 11 months ago
- Uses DARE to merge LoRA stacks as a ComfyUI node☆37May 22, 2024Updated last year
- Swiftly get tons of images from indexed tars on Huggingface☆77Dec 19, 2024Updated last year
- ComfyUI InstructIR☆77May 22, 2024Updated last year
- Dataset helper for loras or checkpoints! Download YouTube videos, extract highest-available-quality screenshots, auto filter for aestheti…☆52Jan 7, 2026Updated 2 months ago
- aesthetic for comfy ui☆34Jun 17, 2024Updated last year
- ☆36Mar 20, 2024Updated last year
- Advanced automated image processing tool for selection, cropping, and standardization. (Helper for stable diffusion), now updated with GU…☆35Mar 23, 2025Updated 11 months ago
- ☆18Feb 21, 2026Updated 2 weeks ago
- MQTT interface for Bluetti power stations☆16Jun 21, 2025Updated 8 months ago
- An app which uses inpainting to create an infinitely scrolling image☆11Jun 11, 2024Updated last year
- ☆17Feb 4, 2026Updated last month
- ComfyUI integration for Unreal Engine 5☆49Dec 15, 2025Updated 2 months ago
- DeepExtract is a powerful and efficient tool designed to separate vocals and sounds from audio files, providing an enhanced experience fo…☆45Aug 26, 2025Updated 6 months ago
- This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…☆10Dec 20, 2023Updated 2 years ago
- API wrapper for uHoo Air☆10Nov 8, 2021Updated 4 years ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated last year
- Custom nodes for ComfyUI to generate empty latent space compatible with Hunyuan models for both image and video generation.☆10Dec 29, 2024Updated last year
- Pre-built Python wheels for ComfyUI 3D (pytorch3d, etc) on Linux systems, facilitating easy installation of GPU-accelerated libraries. vi…☆36Feb 13, 2024Updated 2 years ago
- CRT-Nodes is a collection of custom nodes for ComfyUI.☆95Feb 11, 2026Updated 3 weeks ago
- Fast LLM swapping with sleep/wake support, compatible with vllm, llama.cpp, etc. llama-swap fork.☆30Feb 14, 2026Updated 3 weeks ago
- A clean beamer/ltx-talk theme with a big title graphic☆20Feb 16, 2026Updated 2 weeks ago
- ☆13Apr 13, 2024Updated last year