athena-team / DiDiSpeechView external linksLinks
☆45Oct 24, 2020Updated 5 years ago
Alternatives and similar repositories for DiDiSpeech
Users that are interested in DiDiSpeech are comparing it to the libraries listed below
Sorting:
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- ☆10Mar 20, 2021Updated 4 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 9 months ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 5 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆32Apr 2, 2025Updated 10 months ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆124Oct 8, 2019Updated 6 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago