☆288Jan 2, 2026Updated 4 months ago
Alternatives and similar repositories for LiveTalk
Users that are interested in LiveTalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Feb 14, 2025Updated last year
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆2,063Apr 8, 2026Updated 3 weeks ago
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆21Jan 31, 2024Updated 2 years ago
- ☆30Feb 7, 2026Updated 2 months ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆58Apr 7, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆327Dec 15, 2025Updated 4 months ago
- ☆27Oct 19, 2024Updated last year
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆350Mar 11, 2025Updated last year
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆300Aug 7, 2025Updated 8 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆935Feb 27, 2026Updated 2 months ago
- SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS …☆1,239Apr 2, 2026Updated last month
- ☆98Apr 25, 2026Updated last week
- [CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆458Apr 19, 2026Updated 2 weeks ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆772Nov 12, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆22Feb 10, 2025Updated last year
- Prompt Sniffer by Mohsyn: View / Extract / Copy / Remove AI metadata from images ( right click support )☆25Jun 4, 2025Updated 11 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆104Oct 3, 2025Updated 7 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- ☆15Jan 11, 2024Updated 2 years ago
- ☆18Jun 14, 2025Updated 10 months ago
- [CVPR 2026] Official Pytorch implementation of Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation☆285Feb 22, 2026Updated 2 months ago
- TalkingMachines☆179Aug 2, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis☆169Jan 11, 2026Updated 3 months ago
- ☆177Dec 23, 2025Updated 4 months ago
- 多Agent驱动的实时广播电台 实验性项目☆35Feb 8, 2026Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆72Feb 26, 2026Updated 2 months ago
- ☆20Jun 10, 2025Updated 10 months ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆53Feb 9, 2026Updated 2 months ago
- ☆86Mar 16, 2026Updated last month
- ☆115Nov 6, 2025Updated 5 months ago
- tauri v2, shadcn, tailwindcss 4.x boilerplate☆34Jan 4, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Use MCP tools with Gemini Live API☆25Oct 6, 2025Updated 6 months ago
- ☆22Dec 12, 2025Updated 4 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 7 months ago
- Accurate 3D Facial Geometry Prediction by Multi-Task, Multi-Modal, and Multi-Representation Landmark Refinement Network☆13Oct 20, 2021Updated 4 years ago
- ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling☆130Mar 31, 2026Updated last month
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆28Nov 18, 2025Updated 5 months ago