☆1,688Nov 15, 2025Updated 5 months ago
Alternatives and similar repositories for Ovi
Users that are interested in Ovi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ComfyUI custom nodes for Ovi joint video+audio generation☆47Oct 6, 2025Updated 6 months ago
- ☆76Dec 8, 2025Updated 4 months ago
- [ICLR 2026] LongLive: Real-time Interactive Long Video Generation☆1,147Feb 26, 2026Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,214Oct 15, 2025Updated 6 months ago
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆608Mar 13, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆2,239Apr 2, 2026Updated 2 weeks ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,722Oct 17, 2025Updated 5 months ago
- ☆23Oct 15, 2025Updated 6 months ago
- The official code of Yume☆644Jan 14, 2026Updated 3 months ago
- [CVPR 2026 Highlight] High-Quality Text-to-Video Generation with Alpha Channel☆357Apr 9, 2026Updated last week
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆48Dec 16, 2025Updated 4 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆503Aug 20, 2025Updated 7 months ago
- Scalable and memory-optimized training of diffusion models☆1,351Apr 8, 2026Updated last week
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆345Oct 30, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pusa: Thousands Timesteps Video Diffusion Model☆677Feb 13, 2026Updated 2 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,297Aug 7, 2025Updated 8 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆2,014Updated this week
- Official repository for LTX-Video☆9,872Jan 5, 2026Updated 3 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,498Sep 11, 2025Updated 7 months ago
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆360Mar 26, 2026Updated 3 weeks ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆716Jan 22, 2026Updated 2 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆925Aug 27, 2025Updated 7 months ago
- [AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation☆65Aug 20, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆285Jun 10, 2025Updated 10 months ago
- [ICLR 2026] Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"☆348Jan 28, 2026Updated 2 months ago
- [CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆592Oct 29, 2025Updated 5 months ago
- A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.☆5,012Apr 9, 2026Updated last week
- ☆2,493Jul 16, 2025Updated 9 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,202Jan 25, 2026Updated 2 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆514Dec 11, 2024Updated last year
- A unified inference and post-training framework for accelerated video generation.☆3,365Updated this week
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,982Feb 3, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,353Sep 12, 2025Updated 7 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆423Aug 26, 2025Updated 7 months ago
- [CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆754Feb 21, 2026Updated last month
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,624Jan 26, 2026Updated 2 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆68Dec 11, 2025Updated 4 months ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆229Feb 21, 2026Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,672Jun 17, 2025Updated 9 months ago