Automated image & video captioning using Qwen-VL, Gemma4 and SAM3.
☆74Apr 27, 2026Updated last week
Alternatives and similar repositories for VisionCaptioner
Users that are interested in VisionCaptioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A software to automatically tag images. It's primary use is for training Stable Diffusion checkpoints and loras.☆24Dec 4, 2025Updated 5 months ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 3 months ago
- Doom for Gear VR☆19Jun 4, 2019Updated 6 years ago
- Self-hosted personal AI agent and employee for workflow automation in your DMs. It writes code, runs tools, schedules jobs, saves workflo…☆33Updated this week
- ☆14Jun 25, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Mocap Project☆20Dec 13, 2012Updated 13 years ago
- Inference of MiniCPM-o 2.6 in plain C/C++☆34Oct 14, 2025Updated 6 months ago
- Control Virt-A-Mate (NSFW) commands with the keyboard or gamepads☆14Sep 21, 2020Updated 5 years ago
- Oculus Rift support for Game Maker☆16Jun 16, 2018Updated 7 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆11Apr 12, 2024Updated 2 years ago
- [CVPR 2026 Highlight] XL-VLA: Cross-Hand Latent Representation for Vision-Language-Action Models☆83Apr 15, 2026Updated 2 weeks ago
- Pathtracing☆14Jan 7, 2025Updated last year
- BiRefNet background removal for ComfyUI☆16Dec 19, 2025Updated 4 months ago
- Wiki☆11Mar 11, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unmaintained - .NET VRPN server and framework for Kinect voice recognition and skeletal tracking☆14Feb 6, 2019Updated 7 years ago
- A ComfyUI plugin that integrates [Pascal Editor](https://github.com/pascalorg/editor) — a full-featured 3D architectural editor — directl…☆83Apr 24, 2026Updated last week
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆38Dec 30, 2025Updated 4 months ago
- ☆15Nov 21, 2024Updated last year
- Network-based addon for interfacing with FaceShift Studio from openFrameworks.☆40Oct 12, 2012Updated 13 years ago
- The implementation of 'M3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection'.☆12Apr 18, 2025Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Jun 14, 2024Updated last year
- Control LLM☆23Apr 6, 2025Updated last year
- This repository collects recent NAS based methods and provide a summary (Paper and Code) by year and task. We hope this repo can help yo…☆14Oct 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A list of camouflaged object detection papers, codes and datasets.☆14Sep 8, 2023Updated 2 years ago
- Kinect v2 (XBox One) plugin for Cycling '74 Max☆21Feb 10, 2023Updated 3 years ago
- ☆18Jan 21, 2025Updated last year
- Code for MM-DINOv2: Adapting Foundation Models for Multi-Modal Medical Image Analysis (MICCAI2025)☆25Oct 27, 2025Updated 6 months ago
- An advanced Jupyter Notebook for creating precise datasets tailored to stable Diffusion LoRa training. Automate face detection, similarit…☆46Nov 29, 2024Updated last year
- Multi-GPU device selection for LTXV2 video generation in ComfyUI☆29Jan 10, 2026Updated 3 months ago
- SteamVR support for the Razer Hydra, and possibly other OSVR supported controllers.☆10Apr 19, 2016Updated 10 years ago
- ☆19Jan 30, 2023Updated 3 years ago
- HTML Agility Pack implementation in Powershell for parsing and manipulating HTML☆58Jul 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Buttplug.io Integration for Home Assistant☆30Dec 10, 2025Updated 4 months ago
- FlowPath is a free and open source ComfyUI custom node that intelligently organizes your AI-generated images.☆38Feb 5, 2026Updated 3 months ago
- Karaoke lyrics plugins for Flutter☆10Apr 30, 2022Updated 4 years ago
- Official implementation of "Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection" in CVPR 2022.☆19Mar 28, 2022Updated 4 years ago
- OpenCap Monocular: 3D Human Kinematics and Dynamics From a Single Smartphone Video☆142Apr 15, 2026Updated 2 weeks ago
- IME for Mac.☆10Apr 19, 2026Updated 2 weeks ago
- A repository of Python & PyTorch scripts which (currently) converts .safetensors models into scaled FP8 variants, utilizing gradient desc…☆27Aug 8, 2025Updated 8 months ago