MontrealCorpusTools / kalpy
Pybind11 bindings for Kaldi
☆12Updated 6 months ago
Alternatives and similar repositories for kalpy:
Users that are interested in kalpy are comparing it to the libraries listed below
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆27Updated 7 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- ☆18Updated 7 months ago
- ☆30Updated 2 years ago
- ☆26Updated 2 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆19Updated 7 months ago
- This is the project page of our paper "MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion".☆9Updated last month
- ☆51Updated 5 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 3 years ago
- ☆23Updated 10 months ago
- ☆28Updated 11 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 2 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 3 weeks ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- ☆10Updated 4 months ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆12Updated 2 months ago
- ☆16Updated 3 months ago
- ☆25Updated 8 months ago
- ☆16Updated 7 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated last year
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 4 years ago
- multilingual speech aligner☆74Updated last year
- A simple command line tool to calculate WER for ASR.☆14Updated 6 months ago
- ☆62Updated 11 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆10Updated 2 years ago
- ☆65Updated last year
- ☆29Updated 3 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year