Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 2024)
☆79Apr 9, 2026Updated 2 months ago
Alternatives and similar repositories for FreeTalker
Users that are interested in FreeTalker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons (ACM MM 2023 Oral)☆56Jan 15, 2024Updated 2 years ago
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆16Nov 8, 2022Updated 3 years ago
- DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ ent…☆213Apr 9, 2026Updated 2 months ago
- QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation (CVPR 2023 Highlight)☆105Oct 18, 2023Updated 2 years ago
- Scripts for numerical evaluations for the GENEA Gesture Generation Challenge☆24Nov 28, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- ☆22Apr 17, 2024Updated 2 years ago
- Awesome Gesture Generation☆248Nov 8, 2025Updated 7 months ago
- This repository contains data pre-processing and visualization scripts used in GENEA Challenge 2022 and 2023. Check the repository's READ…☆28May 29, 2025Updated last year
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆36Apr 29, 2025Updated last year
- [CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation☆202Apr 30, 2024Updated 2 years ago
- ☆52Jun 26, 2025Updated 11 months ago
- [AAAI 2025] Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"☆129Jan 18, 2025Updated last year
- [CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation☆263Mar 18, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Implementation of the Paper: Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation (ACMMM 2024)☆78Mar 29, 2026Updated 2 months ago
- PantoMatrix: Generating Face and Body Animation from Speech☆1,255Jan 16, 2025Updated last year
- This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].☆370Nov 1, 2023Updated 2 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Code for the paper "Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters"☆26Jan 7, 2025Updated last year
- I love human motion retargeting.☆67Jul 1, 2025Updated 11 months ago
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion☆142Aug 28, 2024Updated last year
- This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…☆13May 2, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Large Motion Model for Unified Multi-Modal Motion Generation☆306Dec 23, 2024Updated last year
- ☆20Sep 11, 2024Updated last year
- [CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models☆240Mar 17, 2024Updated 2 years ago
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆168Mar 31, 2024Updated 2 years ago
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 3 years ago
- Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)☆275Dec 14, 2021Updated 4 years ago
- Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis [ACMMM 2022]☆27Jun 26, 2025Updated 11 months ago
- [ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".☆87Jun 3, 2024Updated 2 years ago
- This repository contains an example script to convert from a SMPL model to a bvh file.☆228Jun 9, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation” (ICASSP 2025)☆49May 5, 2026Updated last month
- ☆32Feb 22, 2025Updated last year
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆78Oct 21, 2022Updated 3 years ago
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆19Jun 25, 2025Updated 11 months ago
- ICCV 2025☆71Apr 8, 2026Updated 2 months ago