zju3dv/StreamingTalker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zju3dv/StreamingTalker)

zju3dv / StreamingTalker

Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral

☆55

Alternatives and similar repositories for StreamingTalker

Users that are interested in StreamingTalker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zju3dv / PointSplat
View on GitHub
[ECCV 2026] PointSplat: Compact Gaussian Splatting via Human-Centric Prediction
☆45Jul 14, 2026Updated last week
m-hamza-mughal / miburi
View on GitHub
Implementation for MIBURI: Towards Expressive Interactive Gesture Synthesis (CVPR 2026)
☆29Jul 15, 2026Updated last week
RobinWitch / DyStream
View on GitHub
☆37Feb 7, 2026Updated 5 months ago
JaesungHuh / ca-subtitle
View on GitHub
Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"
☆21Nov 3, 2025Updated 8 months ago
Dogter521 / LSF-Animation
View on GitHub
Official repository of Siggraph Asia 2025 paper "LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representa…
☆26Dec 24, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xg-chu / UniLS
View on GitHub
[CVPR 2026] UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking
☆48Apr 20, 2026Updated 3 months ago
xg-chu / ARTalk
View on GitHub
ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.
☆136May 19, 2026Updated 2 months ago
zju3dv / MotionStreamer
View on GitHub
[ICCV 2025] MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
☆287Oct 28, 2025Updated 8 months ago
zju3dv / EgoAgent
View on GitHub
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆53Jun 30, 2025Updated last year
ubisoft / ubisoft-laforge-msmd
View on GitHub
Model See Model Do: Speech-Driven Facial Animation with Style Control
☆27May 6, 2025Updated last year
CoderChen01 / towards-seamless-interaction
View on GitHub
Official repository of the paper "Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics…
☆20Mar 12, 2026Updated 4 months ago
RobinWitch / MECo
View on GitHub
Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…
☆32Mar 29, 2026Updated 3 months ago
kimhyungkyu-1208 / MemoryTalker
View on GitHub
☆17Dec 1, 2025Updated 7 months ago
AlayaLab / FloodDiffusion
View on GitHub
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
☆97Mar 20, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
MCG-NJU / Video-DC
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
WANGSSSSSSS / GS2d_Triton
View on GitHub
Gaussian Splating 2d implemented in triton
☆12Mar 19, 2024Updated 2 years ago
whwjdqls / DEEPTalk
View on GitHub
Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]
☆66Feb 13, 2025Updated last year
jasongzy / Make-It-Poseable
View on GitHub
☆28May 13, 2026Updated 2 months ago
fudan-generative-vision / hallo4
View on GitHub
[SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
☆38Nov 30, 2025Updated 7 months ago
facebookresearch / seamless_interaction
View on GitHub
Foundation Models and Data for Human-Human and Human-AI interactions.
☆400Dec 13, 2025Updated 7 months ago
wsj-sjtu / MMHead
View on GitHub
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation (ACM MM 2024)
☆42Feb 1, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
showlab / Multi-human-Talking-Video-Dataset
View on GitHub
Muti-human Interactive Talking Dataset
☆75Aug 6, 2025Updated 11 months ago
ZiqiaoPeng / DualTalk
View on GitHub
[CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"
☆65Jul 12, 2025Updated last year
miccunifi / ScanTalk
View on GitHub
[ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans
☆55Jan 20, 2026Updated 6 months ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
dadwadw233 / VibePortrait
View on GitHub
🎭 Know yourself as a developer. One command → AI analyzes your coding history → beautiful personality portrait + persona skill. Works wi…
☆25Apr 8, 2026Updated 3 months ago
zju3dv / BoxDreamer
View on GitHub
Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.
☆108Oct 6, 2025Updated 9 months ago
facebookresearch / av_flow
View on GitHub
code for the paper AV-Flow Transforming Text to Audio-Visual Human-like Interactions
☆25Nov 16, 2025Updated 8 months ago
yfyeung / CLSP
View on GitHub
[ACL 2026 Main] Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training
☆104Apr 6, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / SS2_HRTF
View on GitHub
SS2 HRTF Dataset - Reality Labs Research Audio
☆18May 22, 2026Updated 2 months ago
stepfun-ai / StepAudio-Skills
View on GitHub
Audio skills for Claw
☆27Apr 16, 2026Updated 3 months ago
yz-cnsdqz / primal-release
View on GitHub
official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]
☆38Oct 31, 2025Updated 8 months ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
EngineeringAI-LAB / 3DXTalker
View on GitHub
Official repository for 3DXTalker: An Integrated Framework for Expressive 3D Talking Avatars
☆16Apr 6, 2026Updated 3 months ago
liutaocode / talking-face-arxiv-daily
View on GitHub
🎓 Update Talking-Face Research Papers Daily
☆459Updated this week
chenhaoqcdyq / lmr-codes
View on GitHub
Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation
☆18Jan 4, 2026Updated 6 months ago