leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆9Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for pyannote-audio_overlapped-speech-detection_cpp
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆18Updated 2 months ago
- ☆11Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- ☆10Updated last year
- ☆17Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆12Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆39Updated 3 months ago
- ☆17Updated 3 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆25Updated 4 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆16Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆13Updated last month
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆17Updated 3 months ago
- ☆25Updated 3 weeks ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- ☆10Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- source code of EfficientTTS 2☆12Updated 9 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated 2 months ago
- End-to-end diarization loss☆22Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago