athena-team/DiDiSpeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/athena-team/DiDiSpeech)

athena-team / DiDiSpeech

☆45

Alternatives and similar repositories for DiDiSpeech

Users that are interested in DiDiSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
npuichigo / tarzan
View on GitHub
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
alokprasad / lpctron-tts-cpp
View on GitHub
C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.
☆32Oct 1, 2019Updated 6 years ago
open-speech / cn-text-normalizer
View on GitHub
A python module that convert chinese written string to read string. 一个python包：将中文书面字符串转换为口语字符串。
☆124Oct 8, 2019Updated 6 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
yanggeng1995 / EATS
View on GitHub
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
☆127Jul 16, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Riroaki / Chinese-Rhythm-Predictor
View on GitHub
基于随机森林和条件随机场的中文韵律预测模型
☆28Jul 25, 2024Updated 2 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
xuchenglin28 / target_speaker_verification
View on GitHub
target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech
☆15Jan 26, 2021Updated 5 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
mjansche / tts-tutorial
View on GitHub
Text-to-Speech tutorial at SLTU 2016
☆35May 10, 2016Updated 10 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆731Feb 26, 2024Updated 2 years ago
espnet / espnet_tts_frontend
View on GitHub
Text frontend for ESPnet tts recipes
☆35Jun 1, 2021Updated 5 years ago
LianjiaTech / athena
View on GitHub
An open-source implementation of sequence-to-sequence based speech processing engine
☆39Jan 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
candlewill / Ossian
View on GitHub
Ossian: A simple language-independent Text-to-speech frontend
☆17Mar 1, 2018Updated 8 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
zzw922cn / LPC_for_TTS
View on GitHub
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
☆72Mar 19, 2021Updated 5 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆735Mar 18, 2023Updated 3 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago