Tonyyouyou / Mamba-in-Speech
☆16Updated 2 months ago
Related projects: ⓘ
- ConMamba for Automatic Speech Recognition☆38Updated last month
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆24Updated 3 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 9 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆45Updated 2 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆33Updated 9 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆36Updated 2 months ago
- ☆48Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆48Updated 5 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆44Updated last month
- ☆41Updated last year
- ☆63Updated last year
- ☆42Updated last year
- ☆15Updated 2 years ago
- ☆19Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆40Updated this week
- ☆41Updated 2 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆66Updated 3 weeks ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆36Updated 3 weeks ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆32Updated last month
- ☆45Updated 7 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆29Updated 5 months ago
- ☆57Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆23Updated 2 weeks ago
- ☆27Updated 5 months ago
- A toolkit dedicate for speech evaluation.☆18Updated last month
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆32Updated 5 months ago
- ☆19Updated 7 months ago
- Learning differentiable temporal resolution on time-series data.☆33Updated last year