MontrealCorpusTools / kalpy
Pybind11 bindings for Kaldi
☆12Updated 4 months ago
Alternatives and similar repositories for kalpy:
Users that are interested in kalpy are comparing it to the libraries listed below
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 5 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 7 months ago
- multilingual speech aligner☆72Updated last year
- ☆31Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 4 months ago
- ☆48Updated 3 months ago
- ☆27Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 7 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆30Updated 4 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆42Updated 4 months ago
- ☆18Updated 5 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 6 months ago
- ☆36Updated 5 months ago
- ☆26Updated 9 months ago
- ☆25Updated 6 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated last month
- ☆16Updated 2 years ago
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆72Updated 9 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- ☆40Updated 3 years ago
- ☆62Updated 9 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- ☆21Updated 2 weeks ago