sjtuplayer/Harmony

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sjtuplayer/Harmony)

sjtuplayer / Harmony

Audio-video joint generation

☆58

Alternatives and similar repositories for Harmony

Users that are interested in Harmony are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DMCV-SJTU / Make-it-3D-Jittor
View on GitHub
☆39Jul 27, 2024Updated 2 years ago
sjtuplayer / UltraGen
View on GitHub
[AAAI 2026] UltraGen
☆77Feb 1, 2026Updated 5 months ago
Ryan-w2024 / PoseAnything
View on GitHub
☆39Jan 21, 2026Updated 6 months ago
j0seo / lookahead-anchoring
View on GitHub
☆15Oct 27, 2025Updated 9 months ago
ant-research / TensorialGaussianAvatar
View on GitHub
CVPR2025-3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations
☆38Sep 3, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhangzjn / T3-Video
View on GitHub
[ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation
☆41Dec 16, 2025Updated 7 months ago
zhangzjn / Soul
View on GitHub
[CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
☆64Dec 16, 2025Updated 7 months ago
sjtuplayer / IAR
View on GitHub
[CVPR25] IAR
☆18Jun 13, 2025Updated last year
hyj542682306 / Semantic-Frame-Interpolation
View on GitHub
☆21Jul 8, 2025Updated last year
MCG-NJU / Sora2-mini
View on GitHub
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
☆57Dec 16, 2025Updated 7 months ago
jinkun-hao / EgoSim
View on GitHub
[ECCV 2026] EgoSim: Egocentric World Simulator for Embodiment Interaction Generation
☆62Jun 26, 2026Updated last month
xzc-zju / UltraVideo
View on GitHub
[[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions
☆93Jul 14, 2025Updated last year
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 7 months ago
Dorniwang / UniVerse-1-code
View on GitHub
The official UniVerse-1 code.
☆129Oct 13, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HY-SpongeBob / HY-SpongeBob
View on GitHub
☆26May 26, 2026Updated 2 months ago
sjtuplayer / Awesome-Video-Foundations
View on GitHub
Evolution of Video Generative Foundations
☆44Apr 7, 2026Updated 3 months ago
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆171Feb 22, 2026Updated 5 months ago
UVA-Computer-Vision-Lab / FrameINO
View on GitHub
[NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
☆33May 1, 2026Updated 2 months ago
knightyxp / VideoCoF
View on GitHub
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
☆205Jun 17, 2026Updated last month
JavisVerse / JavisGPT
View on GitHub
[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆75Feb 26, 2026Updated 5 months ago
Shi-qingyu / DreamRelation
View on GitHub
[CVPR 2025] DreamRelation: Bridging Customization and Relation Generation
☆19Dec 17, 2025Updated 7 months ago
xyz123xyz456 / hallo4
View on GitHub
☆61Dec 1, 2025Updated 7 months ago
WeChatCV / NovaEdit
View on GitHub
[CVPR26] Nova: Video Editing via single/multiple frame references
☆50Mar 4, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MC-E / InstructX
View on GitHub
☆86Oct 10, 2025Updated 9 months ago
sjtuplayer / MotionMaster
View on GitHub
[ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation
☆103Oct 15, 2024Updated last year
comfyuiattic-989 / ComfyUI-Video-Frame-Extractor
View on GitHub
A ComfyUI custom node that brings a DAW-style interactive video timeline directly into the node graph. Upload any video, scrub through it…
☆18Apr 21, 2026Updated 3 months ago
CUC-MIPG / IC-Effect
View on GitHub
Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"
☆43Jan 29, 2026Updated 6 months ago
poppuppy / SAR
View on GitHub
☆34Dec 29, 2025Updated 7 months ago
sjtuplayer / few-shot-diffusion
View on GitHub
[ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
☆67Dec 7, 2023Updated 2 years ago
Songlin1998 / ShotVerse
View on GitHub
☆103Mar 13, 2026Updated 4 months ago
TaatiTeam / Token-Perturbation-Guidance
View on GitHub
Official implementation of "Token Perturbation Guidance for Diffusion Models" [NeurIPS 2025]
☆17May 19, 2026Updated 2 months ago
yyChen233 / ContextFlow
View on GitHub
The official Pytorch code for paper "ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment"
☆25Apr 8, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zai-org / Kaleido
View on GitHub
Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …
☆144Mar 2, 2026Updated 4 months ago
fudan-generative-vision / hallo4
View on GitHub
[SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
☆38Nov 30, 2025Updated 7 months ago
NJU-LINK / T2AV-Compass
View on GitHub
The Source Code for T2AV-Compass @ ICML 2026
☆20Jun 21, 2026Updated last month
Yubo-Shankui / Bind-Your-Avatar-Implementation
View on GitHub
(CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…
☆34Apr 7, 2026Updated 3 months ago
Phantom-video / LibraGen
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
sjtuplayer / SaRA
View on GitHub
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation
☆122Oct 18, 2024Updated last year
kinam0252 / TIC-FT
View on GitHub
☆52Jan 6, 2026Updated 6 months ago