Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 2024)
☆75Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for FreeTalker
Users that are interested in FreeTalker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons (ACM MM 2023 Oral)☆55Jan 15, 2024Updated 2 years ago
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆16Nov 8, 2022Updated 3 years ago
- DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ ent…☆206Nov 20, 2025Updated 4 months ago
- QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation (CVPR 2023 Highlight)☆103Oct 18, 2023Updated 2 years ago
- Scripts for numerical evaluations for the GENEA Gesture Generation Challenge☆24Nov 28, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated last year
- ☆22Apr 17, 2024Updated last year
- Awesome Gesture Generation☆244Nov 8, 2025Updated 4 months ago
- This repository contains data pre-processing and visualization scripts used in GENEA Challenge 2022 and 2023. Check the repository's READ…☆27May 29, 2025Updated 9 months ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆35Apr 29, 2025Updated 10 months ago
- [CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation☆196Apr 30, 2024Updated last year
- ☆48Jun 26, 2025Updated 9 months ago
- [AAAI 2025] Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"☆125Jan 18, 2025Updated last year
- [CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation☆261Mar 18, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official Implementation of the Paper: Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation (ACMMM 2024)☆77May 29, 2025Updated 9 months ago
- PantoMatrix: Generating Face and Body Animation from Speech☆1,201Jan 16, 2025Updated last year
- This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].☆369Nov 1, 2023Updated 2 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- Code for the paper "Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters"☆25Jan 7, 2025Updated last year
- I love human motion retargeting.☆62Jul 1, 2025Updated 8 months ago
- [CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion☆137Aug 28, 2024Updated last year
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆144Dec 5, 2023Updated 2 years ago
- This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…☆13May 2, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Large Motion Model for Unified Multi-Modal Motion Generation☆308Dec 23, 2024Updated last year
- ☆20Sep 11, 2024Updated last year
- [CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models☆238Mar 17, 2024Updated 2 years ago
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆168Mar 31, 2024Updated last year
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 3 years ago
- Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)☆273Dec 14, 2021Updated 4 years ago
- Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis [ACMMM 2022]☆27Jun 26, 2025Updated 9 months ago
- [ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".☆87Jun 3, 2024Updated last year
- This repository contains an example script to convert from a SMPL model to a bvh file.☆223Jun 9, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”☆52Oct 22, 2024Updated last year
- ☆33Feb 22, 2025Updated last year
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆78Oct 21, 2022Updated 3 years ago
- [INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"☆19Jun 25, 2025Updated 9 months ago
- ICCV 2025☆62Sep 10, 2025Updated 6 months ago