Soul-AILab / SoulX-FlashTalkLinks
SoulX-FlashTalk is the first 14B model to achieve a sub-second start-up latency (0.87s) while sustaining a real-time throughput of 32 FPS
☆72Updated this week
Alternatives and similar repositories for SoulX-FlashTalk
Users that are interested in SoulX-FlashTalk are comparing it to the libraries listed below
Sorting:
- The homepage of LongCat-Video-Avatar☆89Updated 3 weeks ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆145Updated 6 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆61Updated 6 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆53Updated last year
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆288Updated 5 months ago
- Preprocessing Scipts for Talking Face Generation☆92Updated 11 months ago
- 实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换☆56Updated 3 years ago
- A 2D customized lip-sync model for high-fidelity real-time driving.☆118Updated 6 months ago
- PersonaTalk Hack☆15Updated last year
- Generate ARKit expression from audio in realtime☆175Updated 2 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis☆134Updated last month
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Updated last year
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement☆75Updated last year
- [AAAI 2025] The official repository of UniMuMo☆126Updated 3 months ago
- DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework☆141Updated 5 months ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆35Updated 11 months ago
- ☆26Updated 2 years ago
- OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI…☆113Updated 7 months ago
- Official Repo for MoCha Towards Movie-Grade Talking Character Synthesis☆60Updated 2 weeks ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Updated 2 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆35Updated 5 months ago
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Updated last year
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆189Updated last year
- ☆17Updated last year
- project page for ChatAnyone☆115Updated 9 months ago
- ☆62Updated 6 months ago
- ☆55Updated 6 months ago
- mash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me☆73Updated 9 months ago
- Daily tracking of awesome avatar papers, including 2d talking head, 3d head avatar, body avatar.☆77Updated 3 months ago
- ☆14Updated 10 months ago