☆316Jan 2, 2026Updated 5 months ago
Alternatives and similar repositories for LiveTalk
Users that are interested in LiveTalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Feb 14, 2025Updated last year
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆2,140May 31, 2026Updated 2 weeks ago
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆21Jan 31, 2024Updated 2 years ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated 2 months ago
- ☆35Feb 7, 2026Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆354Mar 11, 2025Updated last year
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆301Aug 7, 2025Updated 10 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆961Feb 27, 2026Updated 3 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆23Apr 10, 2026Updated 2 months ago
- SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS …☆1,338May 21, 2026Updated 3 weeks ago
- ☆133Apr 25, 2026Updated last month
- E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker☆56Apr 16, 2026Updated last month
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆811Nov 12, 2025Updated 7 months ago
- X-Voice☆161Jun 5, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆106Oct 3, 2025Updated 8 months ago
- Prompt Sniffer by Mohsyn: View / Extract / Copy / Remove AI metadata from images ( right click support )☆25Jun 4, 2025Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 11 months ago
- ☆15Jan 11, 2024Updated 2 years ago
- ☆18Jun 14, 2025Updated last year
- Confucius4-TTS: a Multilingual and Cross-Lingual Zero-Shot TTS Engine☆143Jun 6, 2026Updated last week
- [CVPR 2026] Official Pytorch implementation of Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation☆309Jun 3, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 5 months ago
- TalkingMachines☆178Aug 2, 2025Updated 10 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis☆177Updated this week
- ☆182Dec 23, 2025Updated 5 months ago
- Agent驱动的实时广播电台 实验性项目☆37Feb 8, 2026Updated 4 months ago
- ☆20Jun 10, 2025Updated last year
- ☆88Mar 16, 2026Updated 2 months ago
- ☆120Nov 6, 2025Updated 7 months ago
- ☆22Dec 12, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 7 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- A high quality and fast TTS repository☆512Dec 22, 2025Updated 5 months ago
- Accurate 3D Facial Geometry Prediction by Multi-Task, Multi-Modal, and Multi-Representation Landmark Refinement Network☆13Oct 20, 2021Updated 4 years ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆29Nov 18, 2025Updated 6 months ago
- Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players☆607May 28, 2026Updated 2 weeks ago
- Matlab codes for PAT image reconstruction from subsampled data based on a novel regularisation term (Hessian Schatten-norm of the filtere…☆11Aug 21, 2019Updated 6 years ago