Batch video captioning using Qwen3-VL-8B vision-language model
☆72Mar 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for Video-Caption-Suite
Users that are interested in Video-Caption-Suite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆62Dec 22, 2025Updated 3 months ago
- Transform ComfyUI into a Universal AI Vibe Coding Agent — From Code to Productivity Automation☆33Jun 5, 2025Updated 9 months ago
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆47Dec 31, 2024Updated last year
- This is a ComfyUI custom node used to convert Qwen-Image LoRA files trained on the ModelScope platform to a format that ComfyUI can recog…☆29Aug 9, 2025Updated 7 months ago
- Python GUI tool for preparing video datasets (LORA, Wan, Hunyuan training). Features range clipping, cropping, FPS conversion & optional …☆96Jun 2, 2025Updated 9 months ago
- ComfyUI wrapper for Motion capture from video☆217Mar 4, 2026Updated 2 weeks ago
- FL HeartMuLa - Multilingual AI music generation nodes for ComfyUI. Generate full songs with lyrics using HeartMuLa.☆119Jan 24, 2026Updated last month
- Upscale, enhance, and reimagine your renders with a single prompt using Stable Diffusion and FLUX.☆14Aug 26, 2024Updated last year
- The official repo of VideoAgentTrek☆46Oct 24, 2025Updated 5 months ago
- Multipurpose lens post process effects node for ComfyUI. Realistic or stylistic lens distortions, chromatic aberration, post-process scal…☆22Jul 10, 2025Updated 8 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- Nodes to run Hunyuan Image 3 locally with BF16 and NF4 quantized options in Comfyui☆42Feb 21, 2026Updated last month
- ComfyUI integration for Unreal Engine 5☆50Dec 15, 2025Updated 3 months ago
- ComfyUI node for AudioSR - Versatile Audio Super Resolution upscales audio to 48kHz using latent diffusion☆72Feb 12, 2026Updated last month
- CLI AI assistant doing your code reviews☆12Updated this week
- Qwen3-TTS text-to-speech nodes for ComfyUI with voice cloning, voice design, and fine-tuning UI☆117Mar 1, 2026Updated 3 weeks ago
- Nodes for high resolution outputs and high frame numbers using LTX-2 in ComfyUI☆135Jan 27, 2026Updated last month
- An upscaler node for flow-matching models like Qwen, applying the DemoFusion approach☆58Jan 29, 2026Updated last month
- ☆19Jul 31, 2024Updated last year
- Image Annotations tools integrated with sam 3☆23Feb 6, 2026Updated last month
- ☆27Jun 30, 2025Updated 8 months ago
- The AI Code Cartographer: A Prompt for Self-Generating Knowledge Graphs☆29Jan 4, 2026Updated 2 months ago
- mcp server for AI to understand osrs☆19Jul 29, 2025Updated 7 months ago
- Repository for Screen2AX paper☆22Mar 12, 2026Updated last week
- SketchColour receives colored first frame and entire scene in sketch format, then colors each frame based on the reference. Evaluated on …☆31Jul 9, 2025Updated 8 months ago
- CrossOS automated setup for AI-development or day-to-day environments in Windows, Linux, and macOS.☆27Feb 22, 2026Updated last month
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Dec 11, 2025Updated 3 months ago
- The official implementation of the Paper: "StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Gui…☆44Oct 17, 2025Updated 5 months ago
- Stable Diffusion for studies☆14Mar 12, 2023Updated 3 years ago
- ☆23Feb 21, 2025Updated last year
- A tokenbased 3d rendering engine for ComfyUI.☆35Aug 8, 2025Updated 7 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- UE5 MediaPipe free plugin motion capture and facial☆13Feb 25, 2023Updated 3 years ago
- ☁️ Synchronise SoundCloud User Likes With a Local Folder☆16Updated this week
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated 9 months ago
- ComfyUI-AniSora is now available in ComfyUI, Index-AniSora is the most powerful open-source animated video generation model. It enables o…☆50May 27, 2025Updated 9 months ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- FIGR-8, but images in .SVG vector graphics format☆15Feb 16, 2019Updated 7 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year