sanjayss34/lm-listener

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sanjayss34/lm-listener)

sanjayss34 / lm-listener

Implementation for the paper "Can Language Models Learn to Listen?"

☆71

Alternatives and similar repositories for lm-listener

Users that are interested in lm-listener are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

evonneng / learning2listen
View on GitHub
Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)
☆128Aug 18, 2024Updated last year
reactmultimodalchallenge / baseline_react2024
View on GitHub
☆16Mar 8, 2024Updated 2 years ago
Dorniwang / AgentAvatar
View on GitHub
AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
☆11Dec 4, 2023Updated 2 years ago
scottgeng00 / realtalk
View on GitHub
The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."
☆15Jan 27, 2023Updated 3 years ago
Boese0601 / Dyadic-Interaction-Modeling
View on GitHub
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation
☆65Apr 23, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lingjivoo / ReactFace
View on GitHub
[TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions
☆23Feb 28, 2025Updated last year
dc3ea9f / vico_challenge_baseline
View on GitHub
☆105Jul 5, 2023Updated 3 years ago
qiqiApink / MotionGPT
View on GitHub
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"
☆238Dec 28, 2023Updated 2 years ago
reactmultimodalchallenge / baseline_react2023
View on GitHub
☆26Feb 12, 2024Updated 2 years ago
wuhaozhe / audio2face_mm2023
View on GitHub
☆48Aug 10, 2023Updated 2 years ago
uuembodiedsocialai / FaceDiffuser
View on GitHub
☆182Feb 15, 2024Updated 2 years ago
yhw-yhw / TalkSHOW
View on GitHub
This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].
☆371Nov 1, 2023Updated 2 years ago
JeremyCJM / DiffSHEG
View on GitHub
[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
☆207Apr 30, 2024Updated 2 years ago
tanshuai0219 / style2talker
View on GitHub
[AAAI 2024] stle2talker - Official PyTorch Implementation
☆54Aug 6, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
geonwooko / MULE
View on GitHub
MuLe: Multi-Grained Graph Learning for Multi-Behavior Recommendation (CIKM 2024)
☆14Dec 21, 2024Updated last year
LTT-O / Awesome-Talking-Head-Generation
View on GitHub
Something about Talking Head Generation
☆31Sep 5, 2023Updated 2 years ago
YoungSeng / DiffuseStyleGesture
View on GitHub
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ ent…
☆214Apr 9, 2026Updated 3 months ago
MotrixLab / ReMoDiffuse
View on GitHub
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
☆379Apr 22, 2024Updated 2 years ago
magic-research / dream-talk
View on GitHub
☆16Jan 8, 2024Updated 2 years ago
jixinya / EAMM
View on GitHub
Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'
☆201Apr 28, 2023Updated 3 years ago
tasyiann / 2Dto3DMotion
View on GitHub
A repo with Unity3D inspector tools, using OpenPose to predict 3D Character animation motion from 2D figures.
☆10Dec 17, 2021Updated 4 years ago
antonibigata / Laughing-Matters
View on GitHub
Official Implementation of "Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models"
☆18Sep 6, 2023Updated 2 years ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
radekd91 / emoca
View on GitHub
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image…
☆850Dec 6, 2024Updated last year
thuhcsi / S2G-MDDiffusion
View on GitHub
☆134Jul 8, 2024Updated 2 years ago
alvinliu0 / HA2G
View on GitHub
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
☆144Mar 16, 2023Updated 3 years ago
ZoneLikeWonderland / HACK-Model
View on GitHub
☆156Apr 23, 2023Updated 3 years ago
yuangan / EAT_code
View on GitHub
Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
☆300Mar 4, 2026Updated 4 months ago
Mael-zys / T2M-GPT
View on GitHub
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
☆769Sep 17, 2024Updated last year
psyai-net / EmoTalk_release
View on GitHub
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
☆420Feb 23, 2024Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
zyhbili / LivelySpeaker
View on GitHub
[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".
☆87Jun 3, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
DiffPoseTalk / DiffPoseTalk
View on GitHub
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
☆355Mar 11, 2025Updated last year
Dorniwang / PD-FGC-inference
View on GitHub
This is official inference code of PD-FGC
☆101Oct 15, 2023Updated 2 years ago
jixinya / EVP
View on GitHub
Code for paper 'Audio-Driven Emotional Video Portraits'.
☆314Mar 16, 2022Updated 4 years ago
bala1144 / Imitator
View on GitHub
☆202Apr 11, 2024Updated 2 years ago
hzwer / MM2022-ViCoPerceptualHeadGeneration
View on GitHub
MM2022 Workshop-Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer
☆55May 16, 2024Updated 2 years ago
johndpope / IMF
View on GitHub
Implicit Motion Function - (unofficial) Microsoft recreation
☆30Nov 19, 2024Updated last year