OpenMOSS/MOVA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/MOVA)

OpenMOSS / MOVA

MOVA: Towards Scalable and Synchronized Video–Audio Generation

☆1,083

Alternatives and similar repositories for MOVA

Users that are interested in MOVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GAIR-NLP / daVinci-MagiHuman
View on GitHub
☆2,101Apr 11, 2026Updated 3 months ago
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆398Updated this week
Guoxu1233 / DreamID-Omni
View on GitHub
[ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
☆274May 22, 2026Updated 2 months ago
KlingAIResearch / UniVideo
View on GitHub
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
☆541Jul 3, 2026Updated 3 weeks ago
showlab / Kiwi-Edit
View on GitHub
A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.
☆306May 13, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OmniForcing / OmniForcing
View on GitHub
[ECCV 2026 Oral] Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForc…
☆169Updated this week
PKU-YuanGroup / Helios
View on GitHub
Helios: Real Real-Time Long Video Generation Model
☆1,999Jun 10, 2026Updated last month
character-ai / Ovi
View on GitHub
☆1,742Nov 15, 2025Updated 8 months ago
MCG-NJU / Sora2-mini
View on GitHub
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
☆57Dec 16, 2025Updated 7 months ago
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
Lightricks / LTX-2
View on GitHub
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
☆8,385Jul 8, 2026Updated 2 weeks ago
HM-RunningHub / ComfyUI_RH_MOVA
View on GitHub
This is a ComfyUI plugin for https://github.com/OpenMOSS/MOVA
☆22Jan 30, 2026Updated 5 months ago
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆479Feb 21, 2026Updated 5 months ago
EzioBy / Ditto
View on GitHub
[CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
☆617Jun 1, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SkyworkAI / SkyReels-V3
View on GitHub
SkyReels V3: Multimodal Video Generation Model
☆520Jan 30, 2026Updated 5 months ago
thu-ml / Causal-Forcing
View on GitHub
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…
☆879Updated this week
OpenMOSS / MOSS-Speech
View on GitHub
MOSS-Speech is a true speech-to-speech large language model without text guidance.
☆138Feb 13, 2026Updated 5 months ago
aigc-apps / VideoX-Fun
View on GitHub
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆2,179Updated this week
ID-LoRA / ID-LoRA
View on GitHub
[ECCV 2026] Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA
☆347Jun 24, 2026Updated last month
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,879Updated this week
KlingAIResearch / ShotStream
View on GitHub
[ECCV 2026] ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
☆172Jun 23, 2026Updated last month
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆772Jun 25, 2026Updated last month
Phantom-video / HuMo
View on GitHub
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
☆1,274Jan 25, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ernie-research / NAVA
View on GitHub
Official Code of NAVA: Native Audio-Visual Alignment for Generation.
☆214Jun 30, 2026Updated 3 weeks ago
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,362Mar 23, 2026Updated 4 months ago
thu-ml / TurboDiffusion
View on GitHub
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
☆3,582Jul 16, 2026Updated last week
bytedance / DreamID-V
View on GitHub
[ECCV 2026 Oral] DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
☆667May 22, 2026Updated 2 months ago
tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆315Jun 21, 2026Updated last month
Alibaba-Quark / LiveAvatar
View on GitHub
[ECCV 2026 Oral] Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
☆2,272Updated this week
ssj9596 / One-to-All-Animation
View on GitHub
[CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
☆490Apr 19, 2026Updated 3 months ago
NVlabs / LongLive
View on GitHub
Long Video Gen Infrastructure
☆2,491Jul 15, 2026Updated last week
FoundationVision / InfinityStar
View on GitHub
[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation
☆773Apr 16, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Correr-Zhou / OmniShow
View on GitHub
[ICML 2026] ByteDance's All-in-One Video Generation Model for Human-Object Interaction Video Generation
☆459May 19, 2026Updated 2 months ago
bytedance / Video-As-Prompt
View on GitHub
[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"
☆441Feb 8, 2026Updated 5 months ago
Kevin-thu / StoryMem
View on GitHub
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆760Updated this week
zai-org / SCAIL
View on GitHub
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)
☆1,024May 6, 2026Updated 2 months ago
PangzeCheung / OmniTransfer
View on GitHub
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer
☆233Apr 15, 2026Updated 3 months ago
Dorniwang / UniVerse-1-code
View on GitHub
The official UniVerse-1 code.
☆129Oct 13, 2025Updated 9 months ago
OpenMOSS / MOSS-Audio
View on GitHub
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoni…
☆617Jun 2, 2026Updated last month