Multiturn VLM Bulk captioning using your api service
☆35Feb 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for vlm-caption
Users that are interested in vlm-caption are comparing it to the libraries listed below
Sorting:
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.☆26Jan 29, 2026Updated last month
- model trained for head detection,graduation project 2023-24☆24Nov 13, 2023Updated 2 years ago
- LiquidTime is a simple yet powerful frame interpolation node for ComfyUI. Just input your sequence and desired frame count - the node han…☆13Apr 3, 2025Updated 11 months ago
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 3 months ago
- A "loopback on steroids" type of extension for Stable Diffusion Web UI.☆31Oct 10, 2025Updated 4 months ago
- ☆30Oct 4, 2024Updated last year
- ☆68Oct 7, 2025Updated 4 months ago
- ☆52Jun 24, 2025Updated 8 months ago
- Study and research with your docs, media, and AI in one place☆33Updated this week
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- ☆32Dec 15, 2023Updated 2 years ago
- Advanced automated image processing tool for selection, cropping, and standardization. (Helper for stable diffusion), now updated with GU…☆35Mar 23, 2025Updated 11 months ago
- ☆13Nov 5, 2024Updated last year
- A comprehensive technical review agent inspired by Bertrand Gilfoyle - providing code quality, security, architecture, and UX analysis wi…☆13Aug 20, 2025Updated 6 months ago
- A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy☆48Feb 24, 2026Updated last week
- ☆13Nov 21, 2025Updated 3 months ago
- Divide and Conquer Node Suite☆118Nov 17, 2025Updated 3 months ago
- ComfyUI integration for Unreal Engine 5☆48Dec 15, 2025Updated 2 months ago
- API wrapper for uHoo Air☆10Nov 8, 2021Updated 4 years ago
- PainterVRAM lets you reserve a slice of GPU memory before ComfyUI starts processing, preventing out-of-memory crashes. Switch between man…☆27Jan 2, 2026Updated 2 months ago
- ☆19Jan 15, 2026Updated last month
- CRT-Nodes is a collection of custom nodes for ComfyUI.☆95Feb 11, 2026Updated 3 weeks ago
- ☆16May 13, 2021Updated 4 years ago
- MovieLabs Ontology for Media Creation (OMC)☆21Feb 7, 2026Updated 3 weeks ago
- This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…☆10Dec 20, 2023Updated 2 years ago
- A powerful tool for automatically generating captions and tags for images using Google's Gemini AI models. #image #caption #tag #photo #r…☆12Mar 16, 2025Updated 11 months ago
- Fully automated Data Forensics / Recovery bot to recover, undelete, index, and organize as much as possible for any hard drive, and optio…☆12Feb 5, 2024Updated 2 years ago
- AI-powered video frame extraction tool that automatically identifies and extracts high-quality frames containing people, with intelligent…☆158Feb 24, 2026Updated last week
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- A custom run space to bypass AMSI and Constrained Language mode in PowerShell.☆21May 17, 2023Updated 2 years ago
- Tartocitron is a repo to have fun with malwares and the Rust language. This repo provides working examples of dropper written in Rust.☆11May 31, 2022Updated 3 years ago
- ☆14Jan 10, 2025Updated last year
- image and latent quilting nodes for comfyui☆10Mar 17, 2025Updated 11 months ago
- Modify ELF executables☆16Mar 5, 2019Updated 7 years ago
- AI-Powered RSS Content Filter - Automatically remove ads, sponsored content, and low-quality articles from your FreshRSS feeds using LLM …☆27Nov 1, 2025Updated 4 months ago
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )☆10Mar 25, 2016Updated 9 years ago
- ☆15Aug 17, 2023Updated 2 years ago
- ☆13Nov 24, 2021Updated 4 years ago
- ☆11Feb 15, 2023Updated 3 years ago