Automated image & video captioning using Qwen-VL, Gemma4 and SAM3.
☆81Apr 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for VisionCaptioner
Users that are interested in VisionCaptioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DaVinci Resolve Studio 20.3 for Windows – professional video editing, color grading, VFX, audio post-production, 4K/8K support, AI tools,…☆38Feb 28, 2026Updated 2 months ago
- A sample Nodejs application to demonstrate Inutit Payment API's ( Support for both OAuth1.0 and OAuth2.0 )☆10Jul 30, 2021Updated 4 years ago
- Fast and flexible keyboard layout analyzer and optimizer☆18Dec 10, 2025Updated 5 months ago
- Removes the Zone.Identifier alternate data stream that identifies files as 'from the internet'☆20Jan 22, 2026Updated 4 months ago
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An AI powered image editing tool for Linux desktops to enhance lighting on people in photographs.☆59Mar 11, 2026Updated 2 months ago
- Uses SAM3 to transfer 2D semantics onto 3D point clouds, producing segmented training data and efficient scene graphs for indoor environm…☆83Jan 23, 2026Updated 4 months ago
- A software to automatically tag images. It's primary use is for training Stable Diffusion checkpoints and loras.☆24Dec 4, 2025Updated 5 months ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 4 months ago
- Doom for Gear VR☆19Jun 4, 2019Updated 6 years ago
- [NeurIPS 2025] UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting☆103Apr 8, 2026Updated last month
- android_device_moto_wingray☆11May 11, 2016Updated 10 years ago
- Deploy your own private OpenAI-compatible LLM☆27Jun 5, 2025Updated 11 months ago
- A practical collection of plug-and-play starter blueprints for building autonomous AI apps with real APIs. Discover ready-to-build agent …☆142Apr 7, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open-source WhatsApp bot with AI capabilities - Logan by YUV.AI☆73Feb 13, 2026Updated 3 months ago
- Theme for TextMate☆14Oct 5, 2017Updated 8 years ago
- Self-hosted personal AI agent and employee for workflow automation in your DMs. It writes code, runs tools, schedules jobs, saves workflo…☆35May 18, 2026Updated last week
- A repo for the Pipecat + Gemini Workshop at the AI Engineer World's Fair☆37Jun 3, 2025Updated 11 months ago
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆63Mar 20, 2026Updated 2 months ago
- Morphological analyzer / inflection engine for Russian and Ukrainian languages. Fork of https://github.com/pymorphy2/pymorphy2☆11Dec 1, 2025Updated 5 months ago
- An Open Source Machine Learning Framework for Everyone☆11Jun 2, 2019Updated 6 years ago
- A Windows executable to generate MilkVR ".mvrl" files for a collection of videos on your local PC, allowing easy access to those videos f…☆12Aug 24, 2017Updated 8 years ago
- ☆14Jun 25, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NucleusImage training recipe☆80Apr 9, 2026Updated last month
- ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.☆24Jan 10, 2026Updated 4 months ago
- Patch for EU4, HoI4 and Stellaris to work on WinXP☆18Jan 7, 2019Updated 7 years ago
- Small, regex-based URL parsing function for use in in browser-based JS☆27Mar 5, 2018Updated 8 years ago
- BOF for Havoc that copies locked Windows files (SAM, SYSTEM, NTDS.dit) via raw MFT parsing — no VSS, no Registry APIs, no PowerShell☆128Apr 6, 2026Updated last month
- A native Application , With Agentic Support (MCP) for ultra fast AI image generation using a highly optimized Z-Image-Turbo model with SD…☆31Feb 3, 2026Updated 3 months ago
- AllJoyn Device Service Bridge for Philips Hue☆11Jan 20, 2016Updated 10 years ago
- Mocap Project☆20Dec 13, 2012Updated 13 years ago
- ☆16Nov 10, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jul 18, 2024Updated last year
- Windows binaries for sfarkxtc (to uncompress soundfonts)☆10Sep 14, 2025Updated 8 months ago
- xargs for Windows (wargs), iconv for Windows (wconv), Text conversion library☆22Sep 7, 2024Updated last year
- Webui for editing/managing LoRA datasets☆29Mar 14, 2024Updated 2 years ago
- A tiny, fully accessible tab switcher for jQuery.☆47May 23, 2017Updated 9 years ago
- ☆12Jul 30, 2019Updated 6 years ago
- Control Virt-A-Mate (NSFW) commands with the keyboard or gamepads☆14Sep 21, 2020Updated 5 years ago