☆320Jan 2, 2026Updated 6 months ago
Alternatives and similar repositories for LiveTalk
Users that are interested in LiveTalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2026] Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆2,209Jun 18, 2026Updated 2 weeks ago
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆21Jan 31, 2024Updated 2 years ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆63Apr 7, 2026Updated 2 months ago
- ☆37Feb 7, 2026Updated 4 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆334Dec 15, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆27Oct 19, 2024Updated last year
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆355Mar 11, 2025Updated last year
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆302Aug 7, 2025Updated 10 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆973Feb 27, 2026Updated 4 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆23Apr 10, 2026Updated 2 months ago
- E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker☆58Apr 16, 2026Updated 2 months ago
- [CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆481Apr 19, 2026Updated 2 months ago
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated 2 years ago
- Hallo-Live: Real-Time Streaming Joint Audio-Video Avatar Generation☆325Jun 24, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆106Oct 3, 2025Updated 9 months ago
- Prompt Sniffer by Mohsyn: View / Extract / Copy / Remove AI metadata from images ( right click support )☆25Jun 4, 2025Updated last year
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated last year
- ☆15Jan 11, 2024Updated 2 years ago
- ☆18Jun 14, 2025Updated last year
- [CVPR 2026] Official Pytorch implementation of Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation☆320Jun 3, 2026Updated last month
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TalkingMachines☆178Aug 2, 2025Updated 11 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis☆185Jun 8, 2026Updated 3 weeks ago
- ☆183Dec 23, 2025Updated 6 months ago
- Agent驱动的实时广播电台 实验性项目☆38Feb 8, 2026Updated 4 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆75Feb 26, 2026Updated 4 months ago
- ☆20Jun 10, 2025Updated last year
- Accepted by ICML2026☆89Updated this week
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆58Feb 9, 2026Updated 4 months ago
- ☆120Nov 6, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- tauri v2, shadcn, tailwindcss 4.x boilerplate☆36Jan 4, 2026Updated 6 months ago
- ☆23Dec 12, 2025Updated 6 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 8 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 9 months ago
- A high quality and fast TTS repository☆516Dec 22, 2025Updated 6 months ago
- Accurate 3D Facial Geometry Prediction by Multi-Task, Multi-Modal, and Multi-Representation Landmark Refinement Network☆13Oct 20, 2021Updated 4 years ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆30Nov 18, 2025Updated 7 months ago