MCG-NJU/Sora2-mini

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MCG-NJU/Sora2-mini)

MCG-NJU / Sora2-mini

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

☆57

Alternatives and similar repositories for Sora2-mini

Users that are interested in Sora2-mini are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tencent-Hunyuan / HY-Video-PRFL
View on GitHub
☆92Jan 13, 2026Updated 6 months ago
Dorniwang / UniVerse-1-code
View on GitHub
The official UniVerse-1 code.
☆129Oct 13, 2025Updated 9 months ago
lian700 / SoliReward
View on GitHub
Official Code for "SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models" [CVPR2…
☆21Jul 13, 2026Updated 2 weeks ago
OmniForcing / OmniForcing
View on GitHub
[ECCV 2026 Oral] Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForc…
☆171Updated this week
sjtuplayer / Harmony
View on GitHub
Audio-video joint generation
☆58Nov 27, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
suimuc / MTV_Framework
View on GitHub
☆23Oct 15, 2025Updated 9 months ago
snap-research / TalkVerse
View on GitHub
☆31Jan 30, 2026Updated 5 months ago
OpenVE-Team / OpenVE-3M
View on GitHub
OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing
☆51Apr 15, 2026Updated 3 months ago
OpenMOSS / MOVA
View on GitHub
MOVA: Towards Scalable and Synchronized Video–Audio Generation
☆1,087Jun 18, 2026Updated last month
wuxiaofei01 / PFVG
View on GitHub
☆20Dec 24, 2025Updated 7 months ago
zhangzjn / Soul
View on GitHub
[CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
☆64Dec 16, 2025Updated 7 months ago
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 3 months ago
Guoxu1233 / DreamID-Omni
View on GitHub
[ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
☆275May 22, 2026Updated 2 months ago
JavisVerse / JavisDiT
View on GitHub
[ICLR 2026] Official implementation of JavisDiT and JavisDiT++ series.
☆376Mar 29, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
character-ai / Ovi
View on GitHub
☆1,743Nov 15, 2025Updated 8 months ago
HKUST-C4G / AnyTalker
View on GitHub
AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement
☆323Apr 15, 2026Updated 3 months ago
MCG-NJU / VideoEval
View on GitHub
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
☆15Jul 31, 2025Updated 11 months ago
MCG-NJU / TimeLens2
View on GitHub
TimeLens2: Generalist Video Temporal Grounding with Multimodal LLMs
☆57Updated this week
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆68Mar 31, 2026Updated 3 months ago
ziyannchen / VFRxBenchmark
View on GitHub
[NTIRE2024] official code for "Towards Real-world Video Face Restoration: A New Benchmark"
☆31Jul 29, 2024Updated 2 years ago
chengtao-lv / LightForcing
View on GitHub
[ICML 2026] Official repository for the paper "Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention"
☆42May 24, 2026Updated 2 months ago
Tencent-Hunyuan / HunyuanCustom
View on GitHub
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
☆1,227Oct 15, 2025Updated 9 months ago
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jianzongwu / Does-Hearing-Help-Seeing
View on GitHub
☆19Dec 3, 2025Updated 7 months ago
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆171Feb 22, 2026Updated 5 months ago
hyj542682306 / Semantic-Frame-Interpolation
View on GitHub
☆21Jul 8, 2025Updated last year
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
bytedance-fanqie-ai / MoGA
View on GitHub
Mixture-of-Groups Attention for End-to-End Long Video Generation
☆99Oct 22, 2025Updated 9 months ago
Phantom-video / LibraGen
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
SkyworkAI / SkyReels-V3
View on GitHub
SkyReels V3: Multimodal Video Generation Model
☆523Jan 30, 2026Updated 5 months ago
NJU-LINK / T2AV-Compass
View on GitHub
The Source Code for T2AV-Compass @ ICML 2026
☆20Jun 21, 2026Updated last month
Sirui-Xu / DuMMF
View on GitHub
[ICLR 2023 spotlight] Official PyTorch implementation of the paper "Stochastic Multi-Person 3D Motion Forecasting"
☆54Sep 1, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BigAandSmallq / SAD
View on GitHub
Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…
☆31Nov 4, 2025Updated 8 months ago
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
manmay-nakhashi / TTSizer
View on GitHub
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆18May 20, 2025Updated last year
m-hamza-mughal / convofusion
View on GitHub
Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
☆37Apr 29, 2025Updated last year
MCG-NJU / SPLAM
View on GitHub
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
☆24Nov 1, 2024Updated last year
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆489Sep 24, 2025Updated 10 months ago
Fu-Fu-Fu-Fu / VideoKR
View on GitHub
[ICML 26 Spotlight] Code for paper "VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding"
☆19Jun 5, 2026Updated last month