plnguyen2908 / LASER_ASDLinks
☆12Updated last month
Alternatives and similar repositories for LASER_ASD
Users that are interested in LASER_ASD are comparing it to the libraries listed below
Sorting:
- ☆50Updated 2 weeks ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆36Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- ☆17Updated 2 years ago
- The project page repo for Neural Dubber.☆30Updated last year
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆109Updated 3 years ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆45Updated 11 months ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆24Updated 2 years ago
- SyncNet for Time Synchronization☆27Updated 2 years ago
- ☆20Updated 3 years ago
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆16Updated 2 years ago
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆70Updated last year
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)☆43Updated 3 months ago
- Code for the paper Real-Time Neural Voice Camouflage☆28Updated 3 years ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆25Updated last year
- ☆10Updated this week
- ☆65Updated 2 years ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆86Updated 9 months ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 6 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆65Updated 4 years ago
- SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents (UIST 2021)☆44Updated 2 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆52Updated 2 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated last year
- ☆23Updated last year
- ☆16Updated 2 years ago
- ☆19Updated 6 months ago
- ☆9Updated 2 years ago