sanjayss34 / lm-listenerView external linksLinks
Implementation for the paper "Can Language Models Learn to Listen?"
☆70Sep 2, 2023Updated 2 years ago
Alternatives and similar repositories for lm-listener
Users that are interested in lm-listener are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)☆126Aug 18, 2024Updated last year
- ☆13Mar 8, 2024Updated last year
- AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents☆11Dec 4, 2023Updated 2 years ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆21Feb 28, 2025Updated 11 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆63Apr 23, 2025Updated 9 months ago
- ☆105Jul 5, 2023Updated 2 years ago
- The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"☆238Dec 28, 2023Updated 2 years ago
- ☆26Feb 12, 2024Updated 2 years ago
- Official Implementation of "Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models"☆18Sep 6, 2023Updated 2 years ago
- ☆177Feb 15, 2024Updated 2 years ago
- This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].☆366Nov 1, 2023Updated 2 years ago
- [CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation☆192Apr 30, 2024Updated last year
- DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ ent…☆205Nov 20, 2025Updated 2 months ago
- ☆133Jul 8, 2024Updated last year
- ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model☆378Apr 22, 2024Updated last year
- Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".☆300May 30, 2025Updated 8 months ago
- ☆27Aug 17, 2023Updated 2 years ago
- Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'☆201Apr 28, 2023Updated 2 years ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆51Aug 6, 2025Updated 6 months ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- Re-implementation for ICCV23 "Social Diffusion: Long-term Multiple Human Motion Anticipation"☆24Oct 3, 2023Updated 2 years ago
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆343Mar 11, 2025Updated 11 months ago
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆19Jan 17, 2026Updated last month
- MuLe: Multi-Grained Graph Learning for Multi-Behavior Recommendation (CIKM 2024)☆14Dec 21, 2024Updated last year
- (CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”☆744Sep 17, 2024Updated last year
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- [NeurlPS-2024] The official code of MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models☆75Jan 9, 2026Updated last month
- ☆48Aug 10, 2023Updated 2 years ago
- [CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"☆144Mar 16, 2023Updated 2 years ago
- ☆200Apr 11, 2024Updated last year
- Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…☆12Jun 26, 2025Updated 7 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion☆58Nov 28, 2024Updated last year
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆90May 29, 2024Updated last year
- [ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".☆86Jun 3, 2024Updated last year
- This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"☆408Feb 23, 2024Updated last year
- ☆49May 20, 2024Updated last year
- Generate accompaniment part with chords using Evolutionary algorithm.☆11May 8, 2022Updated 3 years ago