Multiturn VLM Bulk captioning using your api service
☆37May 2, 2026Updated last month
Alternatives and similar repositories for vlm-caption
Users that are interested in vlm-caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ide-cap-chan is a utility for batch image captioning with natural language using various VL models☆14May 8, 2026Updated last month
- A Powerful LoRA key converter for ComfyUI☆29Nov 17, 2025Updated 6 months ago
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.☆51Jan 29, 2026Updated 4 months ago
- A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy☆79Jun 2, 2026Updated last week
- LiquidTime is a simple yet powerful frame interpolation node for ComfyUI. Just input your sequence and desired frame count - the node han…☆13Apr 3, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tiny model that teaches itself to code better. On your laptop. No cloud. No teacher model. No human feedback.☆64Mar 10, 2026Updated 3 months ago
- A "loopback on steroids" type of extension for Stable Diffusion Web UI.☆31Oct 10, 2025Updated 8 months ago
- Chain IMG processor with plugins for neuronet pipelines etc.☆14Jun 14, 2023Updated 3 years ago
- A Demofusion extension for stable-diffusion-webui☆23Apr 21, 2024Updated 2 years ago
- Work on virtio-win/kvm-guest-drivers-windows see https://github.com/virtio-win/kvm-guest-drivers-windows/pull/943☆37Jun 4, 2026Updated last week
- A wrapper to moviepy optimized for bulk extraction of perfect loop gifs from animation☆12Apr 11, 2022Updated 4 years ago
- For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embeddi…☆25Aug 5, 2025Updated 10 months ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆66Mar 28, 2026Updated 2 months ago
- Jannchie's ComfyUI custom nodes.☆98Apr 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Dependency used by Modders for SPT to import quest files and images into the game.☆10Mar 16, 2025Updated last year
- ☆19Dec 8, 2024Updated last year
- ☆54Jun 24, 2025Updated 11 months ago
- ☆13Jul 28, 2024Updated last year
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projections☆14Nov 1, 2024Updated last year
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆11Jul 31, 2024Updated last year
- PainterVRAM lets you reserve a slice of GPU memory before ComfyUI starts processing, preventing out-of-memory crashes. Switch between man…☆37Jan 2, 2026Updated 5 months ago
- 🍎 Apple Code Assistant - Professional CLI tool powered by Apple Intelligence for on-device code generation. Features modern terminal UI,…☆33Dec 16, 2025Updated 5 months ago
- image and latent quilting nodes for comfyui☆10Mar 17, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tensorflow implement FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors☆15Jul 11, 2019Updated 6 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- A powerful tool for automatically generating captions and tags for images using Google's Gemini AI models. #image #caption #tag #photo #r…☆11Mar 16, 2025Updated last year
- ☆69Oct 7, 2025Updated 8 months ago
- Сleans prompts from duplicates and other garbage. Constructs prompt using various nodes☆16Jul 16, 2025Updated 10 months ago
- ComfyUI Node for FlashFace☆69Feb 27, 2025Updated last year
- ☆32Dec 15, 2023Updated 2 years ago
- 🔮 A powerful and stylish Prompt Generator powered by OpenAI and Python. Includes a built-in JSON editor, modular prompt libraries, and f…☆20Jul 12, 2025Updated 11 months ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A tool for tagging and preparing images for training text to image models.☆26May 16, 2026Updated 3 weeks ago
- Custom node for total control over resolution and aspect ratio. It provides an intuitive interface with an interactive canvas, advanced s…☆273May 26, 2026Updated 2 weeks ago
- Code Release for ECCV 2024, "PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion"☆21Mar 23, 2025Updated last year
- Face super resolution☆10Aug 28, 2020Updated 5 years ago
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆25Nov 5, 2024Updated last year
- CRT-Nodes is a collection of custom nodes for ComfyUI.☆116May 30, 2026Updated 2 weeks ago
- ☆43Sep 30, 2025Updated 8 months ago