TalkingMachines
☆178Aug 2, 2025Updated 7 months ago
Alternatives and similar repositories for TalkingMachines
Users that are interested in TalkingMachines are comparing it to the libraries listed below
Sorting:
- ☆30Jun 30, 2025Updated 8 months ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆148Jun 11, 2025Updated 8 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆335Dec 7, 2025Updated 2 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated 11 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆45Jan 25, 2026Updated last month
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆343Mar 11, 2025Updated 11 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆233Aug 22, 2025Updated 6 months ago
- Preprocessing Scipts for Talking Face Generation☆94Jan 21, 2025Updated last year
- Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions☆63May 13, 2025Updated 9 months ago
- ☆30Mar 24, 2025Updated 11 months ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生 成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- A plugin application that utilizes ComfyUI to generate 360-degree panoramic images. It primarily works by converting between flat images …☆16Jun 23, 2025Updated 8 months ago
- Official repository for "GPHM: Gaussian Parametric Head Model for Monocular Head Avatar Reconstruction"☆23Oct 28, 2024Updated last year
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆238Nov 12, 2025Updated 3 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆456Mar 5, 2025Updated last year
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- ☆13Jul 10, 2024Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 4 months ago
- LIA-X: Interpretable Latent Portrait Animator☆99Sep 17, 2025Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆672Feb 13, 2026Updated 3 weeks ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding☆34Jul 12, 2024Updated last year
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆377Jan 23, 2026Updated last month
- Scaling Zero-Shot Reference-to-Video Generation☆63Dec 11, 2025Updated 2 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 10 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- SDK and pollen-vision tutorials for users of Reachy2☆17Jul 31, 2025Updated 7 months ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆714Nov 12, 2025Updated 3 months ago
- [NeurIPS 2025 Spotlight] Official repository for “Puppeteer: Rig and Animate Your 3D Models”☆337Sep 19, 2025Updated 5 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Jan 24, 2025Updated last year
- Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)☆3,180Sep 12, 2025Updated 5 months ago
- ☆259Feb 27, 2026Updated last week
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆291Aug 7, 2025Updated 6 months ago
- ☆27May 30, 2025Updated 9 months ago
- ☆21Jun 3, 2023Updated 2 years ago
- ☆16Nov 28, 2023Updated 2 years ago