Soul-AILab / SoulX-FlashTalkView external linksLinks
SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.
☆715Jan 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for SoulX-FlashTalk
Users that are interested in SoulX-FlashTalk are comparing it to the libraries listed below
Sorting:
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆1,797Jan 30, 2026Updated 2 weeks ago
- ☆1,787Aug 6, 2025Updated 6 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆4,793Dec 18, 2025Updated last month
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆242Nov 11, 2025Updated 3 months ago
- [SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head☆918Sep 11, 2025Updated 5 months ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 7 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,620Jan 26, 2026Updated 2 weeks ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆582Jun 5, 2025Updated 8 months ago
- ☆28Jan 30, 2026Updated 2 weeks ago
- This is the PyTorch implementation of the Siggraph 2023 paper "Efficient Video Portrait Reenactment via Grid-based Codebook"☆39Aug 28, 2023Updated 2 years ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆501Aug 20, 2025Updated 5 months ago
- MOVA: Towards Scalable and Synchronized Video–Audio Generation☆630Updated this week
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆393Apr 8, 2025Updated 10 months ago
- ☆82Jan 4, 2026Updated last month
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement☆75Jul 29, 2024Updated last year
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆755Feb 5, 2026Updated last week
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,641Sep 14, 2024Updated last year
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,675Feb 27, 2025Updated 11 months ago
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆278Dec 5, 2025Updated 2 months ago
- Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars☆95Nov 4, 2024Updated last year
- Taming Stable Diffusion for Lip Sync!☆5,421Jun 20, 2025Updated 7 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,305Sep 26, 2025Updated 4 months ago
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,188Jan 8, 2026Updated last month
- Align Anything: Training All-modality Model with Feedback☆4,632Nov 27, 2025Updated 2 months ago
- ☆29Jul 4, 2025Updated 7 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,367Mar 13, 2025Updated 11 months ago
- The next generation deep reinforcement learning tookit☆3,459Jun 16, 2023Updated 2 years ago
- 🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.☆7,835Dec 31, 2025Updated last month
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,077Aug 6, 2025Updated 6 months ago
- The official implementation of RealisDance☆610Jun 20, 2025Updated 7 months ago
- VectorTalker: SVG Talking Face Generation with Progressive Vectorisation☆15Dec 25, 2023Updated 2 years ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆77Aug 20, 2025Updated 5 months ago
- ☆251Jan 2, 2026Updated last month
- ☆148Dec 23, 2025Updated last month
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- ☆63Dec 1, 2025Updated 2 months ago
- ICLR 2025 paper X-NeMo & Project X-Portrati2☆114Aug 7, 2025Updated 6 months ago