baaivision/MTVCraft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/baaivision/MTVCraft)

baaivision / MTVCraft

MTVCraft: An Open Veo3-style Audio-Video Generation Demo

☆98

Alternatives and similar repositories for MTVCraft

Users that are interested in MTVCraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

costrice / vminer
View on GitHub
Official implementation and project page of the CVPR'24 paper "VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Li…
☆14Aug 6, 2024Updated last year
suimuc / MTV_Framework
View on GitHub
☆23Oct 15, 2025Updated 9 months ago
suimuc / InstructAV2AV
View on GitHub
☆41Jun 5, 2026Updated last month
changzheng123 / L-CoIns
View on GitHub
Implementation for "L-CoIns: Language-based Colorization with Instance Awareness"
☆11Dec 7, 2023Updated 2 years ago
changzheng123 / L-CoDer
View on GitHub
Implementation for for "L-CoDer: Language-based Colorization with Color-object Decoupling Transformer"
☆13Jan 20, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
klingfoley / Kling-Foley
View on GitHub
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
☆62Jun 26, 2025Updated last year
VariantConst / PanoWan
View on GitHub
Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"
☆52Dec 18, 2025Updated 7 months ago
camenduru / FluxMusic-jupyter
View on GitHub
☆18Sep 4, 2024Updated last year
zsxkib / cog-create-video-dataset
View on GitHub
Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning
☆15Apr 2, 2025Updated last year
costrice / split
View on GitHub
Official implementation for paper "SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics"
☆19Jul 8, 2024Updated 2 years ago
tanABCC / VABench
View on GitHub
☆16Jul 8, 2026Updated 3 weeks ago
microsoft / AVGen-Bench
View on GitHub
[ICML26] AVGen-Bench is a task-driven benchmark for multi-granular evaluation of Text-to-Audio-Video (T2AV) generation.
☆24Jul 2, 2026Updated 3 weeks ago
DINGYANB / MTVCrafter
View on GitHub
Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.
☆277Feb 3, 2026Updated 5 months ago
changzheng123 / L-CoDe
View on GitHub
Official code for AAAI 2022 paper "L-CoDe: Language-based Colorization Using Color-object Decoupled Conditions"
☆19Jan 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RamonGuthrie / ComfyUI-RBG-LoraConverter
View on GitHub
A Powerful LoRA key converter for ComfyUI
☆29Nov 17, 2025Updated 8 months ago
InternLM / StarBench
View on GitHub
[ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"
☆42Apr 19, 2026Updated 3 months ago
changzheng123 / L-CAD
View on GitHub
Implementation for "L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors"
☆45Jun 16, 2025Updated last year
Yaofang-Liu / Pusa-VidGen
View on GitHub
Pusa: Thousands Timesteps Video Diffusion Model
☆686Feb 13, 2026Updated 5 months ago
HM-RunningHub / ComfyUI_RH_MOVA
View on GitHub
This is a ComfyUI plugin for https://github.com/OpenMOSS/MOVA
☆22Jan 30, 2026Updated 5 months ago
camenduru / autocaption-colab
View on GitHub
☆19Jan 15, 2024Updated 2 years ago
GhostCai / PortraitRelighting
View on GitHub
Official PyTorch implementation of the CVPR 2024 Highlight Paper "Real-time 3D-aware Portrait Video Relighting"
☆67Oct 23, 2024Updated last year
TheDenk / wan2.1-dilated-controlnet
View on GitHub
Controlnet module for Wan2.1
☆32Aug 4, 2025Updated 11 months ago
Dorniwang / UniVerse-1-code
View on GitHub
The official UniVerse-1 code.
☆129Oct 13, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AIGeeksGroup / UniVid
View on GitHub
UniVid: The Open-Source Unified Video Model
☆32Oct 13, 2025Updated 9 months ago
camenduru / ExVideo-jupyter
View on GitHub
☆14Jun 23, 2024Updated 2 years ago
character-ai / Ovi
View on GitHub
☆1,743Nov 15, 2025Updated 8 months ago
ozekimasaki / obs-motion-pngtuber-player
View on GitHub
☆18Jul 18, 2026Updated last week
camenduru / TANGO-jupyter
View on GitHub
☆13Oct 14, 2024Updated last year
ai-forever / KandiSuperRes
View on GitHub
☆30Aug 21, 2024Updated last year
alibaba-damo-academy / Lumos
View on GitHub
[ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆161Apr 6, 2026Updated 3 months ago
Apple-jun / FilmComposer
View on GitHub
Music production for silent film clips.
☆34Apr 30, 2025Updated last year
wx9Songs / MOSS-Music-Data-Pipeline
View on GitHub
☆44Apr 26, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Fantasy-AMAP / fantasy-talking2
View on GitHub
[AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation
☆65Aug 20, 2025Updated 11 months ago
alpoktem / movie2parallelDB
View on GitHub
Automatic parallel speech database extractor from dubbed movies
☆27Aug 20, 2024Updated last year
KlingAIResearch / SynCamMaster
View on GitHub
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
☆692May 23, 2025Updated last year
zsxkib / cog-mmaudio
View on GitHub
Replicate Cog'ified MMAudio
☆18Apr 2, 2025Updated last year
camenduru / echomimic-jupyter
View on GitHub
☆14Nov 22, 2024Updated last year
yanghaha0908 / WavCube
View on GitHub
Official code for "WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling"
☆62Jun 27, 2026Updated last month
zghhui / OmniNFT
View on GitHub
Code for "OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation"
☆150Jun 18, 2026Updated last month