CoEDL / vad-sli-asr
A pipeline to isolate and transcribe one language in mixed-language speech
☆18Updated 2 years ago
Alternatives and similar repositories for vad-sli-asr:
Users that are interested in vad-sli-asr are comparing it to the libraries listed below
- Workflow for forced alignment between languages☆18Updated last year
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- ☆25Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆24Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- ☆56Updated 2 years ago
- ☆10Updated last month
- Clustering-based methods for overlapping diarization☆80Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- A handy dataset of noises for ASR☆21Updated 5 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆49Updated 9 months ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last week
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆20Updated 4 months ago
- ☆12Updated 2 months ago
- An extension of PHOIBLE that includes features for allophones.☆10Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 7 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated 11 months ago