Batch video captioning using Qwen3-VL-8B vision-language model
☆79Apr 19, 2026Updated 2 weeks ago
Alternatives and similar repositories for Video-Caption-Suite
Users that are interested in Video-Caption-Suite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆64Dec 22, 2025Updated 4 months ago
- ☆19Jan 17, 2025Updated last year
- Transform ComfyUI into a Universal AI Vibe Coding Agent — From Code to Productivity Automation☆34Jun 5, 2025Updated 11 months ago
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆48Dec 31, 2024Updated last year
- This is a ComfyUI custom node used to convert Qwen-Image LoRA files trained on the ModelScope platform to a format that ComfyUI can recog…☆30Aug 9, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python GUI tool for preparing video datasets (LORA, Wan, Hunyuan training). Features range clipping, cropping, FPS conversion & optional …☆98Jun 2, 2025Updated 11 months ago
- ComfyUI wrapper for Motion capture from video☆241Updated this week
- FL HeartMuLa - Multilingual AI music generation nodes for ComfyUI. Generate full songs with lyrics using HeartMuLa.☆124Apr 25, 2026Updated last week
- Upscale, enhance, and reimagine your renders with a single prompt using Stable Diffusion and FLUX.☆14Aug 26, 2024Updated last year
- SillyTavern fork with personal customizations.☆13Apr 9, 2025Updated last year
- The official repo of VideoAgentTrek☆48Oct 24, 2025Updated 6 months ago
- Multipurpose lens post process effects node for ComfyUI. Realistic or stylistic lens distortions, chromatic aberration, post-process scal…☆22Jul 10, 2025Updated 9 months ago
- A simple PyQt5 UI for editing TavernAI character cards☆19Apr 20, 2024Updated 2 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AGX Dynamics for Unreal plugin.☆12Updated this week
- ComfyUI integration for Unreal Engine 5☆54Dec 15, 2025Updated 4 months ago
- ComfyUI node for AudioSR - Versatile Audio Super Resolution upscales audio to 48kHz using latent diffusion☆77Feb 12, 2026Updated 2 months ago
- CLI AI assistant doing your code reviews☆12Updated this week
- Qwen3-TTS text-to-speech nodes for ComfyUI with voice cloning, voice design, and fine-tuning UI☆125Apr 25, 2026Updated last week
- Nodes for high resolution outputs and high frame numbers using LTX-2 in ComfyUI☆140Jan 27, 2026Updated 3 months ago
- AUTOMATIC1111 port of Bubble Prompter by pols on HF☆19Nov 12, 2024Updated last year
- [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation☆23May 24, 2025Updated 11 months ago
- An upscaler node for flow-matching models like Qwen, applying the DemoFusion approach☆60Jan 29, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- soon☆10Updated this week
- ☆28Jun 30, 2025Updated 10 months ago
- Image Annotations tools integrated with sam 3☆25Feb 6, 2026Updated 2 months ago
- The AI Code Cartographer: A Prompt for Self-Generating Knowledge Graphs☆29Jan 4, 2026Updated 4 months ago
- mcp server for AI to understand osrs☆20Jul 29, 2025Updated 9 months ago
- SketchColour receives colored first frame and entire scene in sketch format, then colors each frame based on the reference. Evaluated on …☆32Jul 9, 2025Updated 9 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆117Dec 11, 2025Updated 4 months ago
- CrossOS automated setup for AI-development or day-to-day environments in Windows, Linux, and macOS.☆28Mar 27, 2026Updated last month
- ☆19Jul 31, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆61Apr 26, 2026Updated last week
- The official implementation of the Paper: "StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Gui…☆45Oct 17, 2025Updated 6 months ago
- Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Sil…☆32Dec 4, 2025Updated 5 months ago
- ☆93Mar 5, 2026Updated last month
- ☆23Feb 21, 2025Updated last year
- An example repo for how to use StartupOS☆12Aug 26, 2018Updated 7 years ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year