☆313Jan 2, 2026Updated 4 months ago
Alternatives and similar repositories for LiveTalk
Users that are interested in LiveTalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆2,090Apr 8, 2026Updated last month
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆21Jan 31, 2024Updated 2 years ago
- ☆31Feb 7, 2026Updated 3 months ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆60Apr 7, 2026Updated last month
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆327Dec 15, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆353Mar 11, 2025Updated last year
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆301Aug 7, 2025Updated 9 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆946Feb 27, 2026Updated 2 months ago
- SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS …☆1,285Updated this week
- ☆122Apr 25, 2026Updated last month
- E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker☆55Apr 16, 2026Updated last month
- [CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆464Apr 19, 2026Updated last month
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆789Nov 12, 2025Updated 6 months ago
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆105Oct 3, 2025Updated 7 months ago
- Prompt Sniffer by Mohsyn: View / Extract / Copy / Remove AI metadata from images ( right click support )☆25Jun 4, 2025Updated 11 months ago
- Multi-Agent Collaboration Design Patterns Built on LangGraph with 10+ battle-tested patterns, each with complete code, architectu…☆48Apr 9, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- ☆15Jan 11, 2024Updated 2 years ago
- ☆18Jun 14, 2025Updated 11 months ago
- [CVPR 2026] Official Pytorch implementation of Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation☆299Feb 22, 2026Updated 3 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Dec 16, 2025Updated 5 months ago
- TalkingMachines☆179Aug 2, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis☆172Jan 11, 2026Updated 4 months ago
- ☆178Dec 23, 2025Updated 5 months ago
- Local Texture Pattern Estimation for Image Detail Super-Resolution☆24Apr 11, 2025Updated last year
- Agent驱动的实时广播电台 实验性项目☆37Feb 8, 2026Updated 3 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆73Feb 26, 2026Updated 2 months ago
- ☆20Jun 10, 2025Updated 11 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆217May 6, 2026Updated 2 weeks ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆56Feb 9, 2026Updated 3 months ago
- ☆116Nov 6, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- tauri v2, shadcn, tailwindcss 4.x boilerplate☆34Jan 4, 2026Updated 4 months ago
- ☆22Dec 12, 2025Updated 5 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- A high quality and fast TTS repository☆512Dec 22, 2025Updated 5 months ago
- Accurate 3D Facial Geometry Prediction by Multi-Task, Multi-Modal, and Multi-Representation Landmark Refinement Network☆13Oct 20, 2021Updated 4 years ago
- ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling☆135Mar 31, 2026Updated last month