MTVCraft: An Open Veo3-style Audio-Video Generation Demo
☆98Oct 8, 2025Updated 7 months ago
Alternatives and similar repositories for MTVCraft
Users that are interested in MTVCraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation and project page of the CVPR'24 paper "VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Li…☆14Aug 6, 2024Updated last year
- ☆23Oct 15, 2025Updated 7 months ago
- Implementation for for "L-CoDer: Language-based Colorization with Color-object Decoupling Transformer"☆13Jan 20, 2024Updated 2 years ago
- ☆19Sep 4, 2024Updated last year
- Replicate Cog'ified MMAudio☆18Apr 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for AAAI 2022 paper "L-CoDe: Language-based Colorization Using Color-object Decoupled Conditions"☆19Jan 8, 2024Updated 2 years ago
- Music production for silent film clips.☆32Apr 30, 2025Updated last year
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 6 months ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆14Apr 2, 2025Updated last year
- [AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation☆65Aug 20, 2025Updated 9 months ago
- SketchColour receives colored first frame and entire scene in sketch format, then colors each frame based on the reference. Evaluated on …☆32Jul 9, 2025Updated 10 months ago
- ☆13Oct 14, 2024Updated last year
- Official PyTorch implementation of the CVPR 2024 Highlight Paper "Real-time 3D-aware Portrait Video Relighting"☆64Oct 23, 2024Updated last year
- ☆11Sep 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.☆275Feb 3, 2026Updated 3 months ago
- 🎬 AI Movie Script & Storyboard Generator – An AI-powered tool that creates movie scripts with GPT-4 and visual storyboards using DALL-E …☆14Oct 9, 2024Updated last year
- VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization☆21Jan 17, 2025Updated last year
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 9 months ago
- ☆73Mar 10, 2026Updated 2 months ago
- Official code for ECCV 2022 paper ``CT2: Colorization Transformer via Color Tokens"☆87Jun 21, 2023Updated 2 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 6 months ago
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆25Jan 7, 2025Updated last year
- LoRA Pilot is an ultimate docker image for all Stable Diffusion LoRA trainers. Includes kohya_ss, diffusion pipes and TensorBoard for tra…☆63May 10, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official code for ICCV 2021 paper "Towards Vivid and Diverse Image Colorization with Generative Color Prior".☆49Aug 1, 2023Updated 2 years ago
- ☆18Jan 17, 2025Updated last year
- ☆36Dec 26, 2023Updated 2 years ago
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆37Aug 26, 2025Updated 9 months ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆160Apr 6, 2026Updated last month
- Make Kanye sing any song ya want 🎤🔥☆25Apr 25, 2023Updated 3 years ago
- Batch video captioning using Qwen3-VL-8B vision-language model☆80Apr 19, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- Upload a video and provide a prompt to generate a narration.☆12Mar 5, 2025Updated last year
- Production-ready, Light, and Flexible Webhook Infrastructure | Effortlessly Build Performant Webhook Integrations☆12Sep 8, 2024Updated last year
- Controlnet module for Wan2.1☆31Aug 4, 2025Updated 9 months ago
- ☆30Aug 21, 2024Updated last year
- Useing the ComfyUI workflow through a chat interface☆20Apr 14, 2025Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆34Dec 15, 2024Updated last year