ide-cap-chan is a utility for batch image captioning with natural language using various VL models
☆14May 1, 2025Updated 11 months ago
Alternatives and similar repositories for ide-cap-chan
Users that are interested in ide-cap-chan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jan 15, 2025Updated last year
- A simple aesthetic scorer + pruner + website you can run to view the results from the scoring with☆16Jun 3, 2024Updated last year
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- Swiftly get tons of images from indexed tars on Huggingface☆78Dec 19, 2024Updated last year
- ☆36Mar 20, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A powerful extension for ComfyUI that enables adding notes to any node in your workflow.☆13Apr 20, 2025Updated 11 months ago
- adds a few extra samplers and schedulers to the dropdowns in recent A1111-derived webUIs for Stable Diffusion☆26Dec 5, 2025Updated 4 months ago
- A Demofusion extension for stable-diffusion-webui☆23Apr 21, 2024Updated last year
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆48Dec 31, 2024Updated last year
- Modifications to ComfyUI diffusers wrapper node X-Adapter to make it more ComfyUI-node-like☆22Jun 27, 2024Updated last year
- simple diffusers based implementation of Hunyuan-DiT, in Forge webUI for Stable Diffusion. Works with 8GB VRAM.☆17Feb 20, 2025Updated last year
- Stable Diffusion Model Checkpoint Merger☆42Nov 9, 2022Updated 3 years ago
- ☆27Oct 19, 2024Updated last year
- First PuLID implementation for FLUX.2 — Consistent face identity in ComfyUI☆73Mar 28, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆50Mar 20, 2024Updated 2 years ago
- ☆26Dec 14, 2024Updated last year
- FoleyCrafter is a video-to-audio generation framework which can produce realistic sound effects semantically relevant and synchronized wi…☆66May 29, 2025Updated 10 months ago
- A tool for tagging and preparing images for training text to image models.☆26Updated this week
- AUTOMATIC1111 UI custom script for img2img around face with different "Denoising Strength" settings☆26Jan 11, 2023Updated 3 years ago
- Custom ComfyUI nodes using pytorch360convert☆30Sep 22, 2025Updated 6 months ago
- ComfyUI InstructIR☆78May 22, 2024Updated last year
- Go command line app to exploit file upload vulnerability☆12Feb 8, 2017Updated 9 years ago
- Adds alert blockquote support to VS Code's built-in markdown preview☆13Dec 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆76Aug 26, 2025Updated 7 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆20Jun 26, 2025Updated 9 months ago
- An interactive sketching and drawing node for ComfyUI with stylus/pen support – built for fast, intuitive scribbling directly inside your…☆60Aug 9, 2025Updated 8 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆80Jul 19, 2025Updated 8 months ago
- aesthetic for comfy ui☆35Jun 17, 2024Updated last year
- Uses DARE to merge LoRA stacks as a ComfyUI node☆37May 22, 2024Updated last year
- Dataset helper for loras or checkpoints! Download YouTube videos, extract highest-available-quality screenshots, auto filter for aestheti…☆54Apr 8, 2026Updated last week
- Custom nodes for ComfyUI to generate empty latent space compatible with Hunyuan models for both image and video generation.☆10Dec 29, 2024Updated last year
- A clean beamer/ltx-talk theme with a big title graphic☆21Apr 6, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DeepExtract is a powerful and efficient tool designed to separate vocals and sounds from audio files, providing an enhanced experience fo…☆48Aug 26, 2025Updated 7 months ago
- ☆68Oct 7, 2025Updated 6 months ago
- 《辐射小马国:粉色双眸》的重排版☆12Oct 11, 2019Updated 6 years ago
- Pre-built Python wheels for ComfyUI 3D (pytorch3d, etc) on Linux systems, facilitating easy installation of GPU-accelerated libraries. vi…☆35Feb 13, 2024Updated 2 years ago
- This node is base on VisualCloze method, A Universal Image Generation Framework via Visual In-Context Learning☆11May 21, 2025Updated 10 months ago
- [DEPRECATED] Attempts to convert a Flux lora to a Chroma lora☆20Nov 9, 2025Updated 5 months ago
- An app which uses inpainting to create an infinitely scrolling image☆11Jun 11, 2024Updated last year