wenet-e2e/WeSpeech-AI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenet-e2e/WeSpeech-AI)

wenet-e2e / WeSpeech-AI

Open Source Speech/Text Data on AI

☆19

Alternatives and similar repositories for WeSpeech-AI

Users that are interested in WeSpeech-AI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
k2-fsa / multi_quantization
View on GitHub
☆46Nov 2, 2023Updated 2 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
huismiling / wenet_trt8
View on GitHub
☆75Jun 27, 2022Updated 4 years ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆39Mar 31, 2026Updated 3 months ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
insunhwang89 / StyleVC
View on GitHub
☆33Jan 14, 2023Updated 3 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
tts-tutorial / book
View on GitHub
☆63Sep 18, 2022Updated 3 years ago
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Dec 2, 2025Updated 7 months ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
So-Fann / VISinger
View on GitHub
☆55Aug 11, 2022Updated 3 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wenet-e2e / WeTextProcessing
View on GitHub
Text Normalization & Inverse Text Normalization
☆802Updated this week
ga642381 / Speech-Prompts-Adapters
View on GitHub
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆113Aug 4, 2023Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
imdanboy / jets
View on GitHub
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆112Jun 6, 2022Updated 4 years ago
keonlee9420 / WaveGrad2
View on GitHub
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
☆68Aug 3, 2021Updated 4 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago