☆45Oct 24, 2020Updated 5 years ago
Alternatives and similar repositories for DiDiSpeech
Users that are interested in DiDiSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆124Oct 8, 2019Updated 6 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆352Dec 25, 2020Updated 5 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆45Dec 16, 2019Updated 6 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 5 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆362Dec 24, 2021Updated 4 years ago
- Large, modern dataset for speech recognition☆721Feb 26, 2024Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- Ossian: A simple language-independent Text-to-speech frontend☆17Mar 1, 2018Updated 8 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 5 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago