Mddct/WeUSM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mddct/WeUSM)

Mddct / WeUSM

☆13

Alternatives and similar repositories for WeUSM

Users that are interested in WeUSM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Slyne / ctc_decoder
View on GitHub
A ctc decoder for both online and offline asr model
☆66Nov 18, 2023Updated 2 years ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Dec 2, 2025Updated 7 months ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
kyegomez / MELLE
View on GitHub
An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"
☆16Updated this week
pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆39Mar 31, 2026Updated 3 months ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
pengzhendong / compute-wer
View on GitHub
Compute WER and SER for speech recognition evaluation
☆27Jun 6, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lovemefan / fsmn-vad
View on GitHub
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆139Apr 26, 2023Updated 3 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
Kirili4ik / kws-attention-pytorch
View on GitHub
Keyword spotting for audio with attention (KWS model for audio)
☆18Jul 15, 2021Updated 5 years ago
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated 2 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
XiaoMi / dasheng
View on GitHub
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
☆199Nov 7, 2025Updated 8 months ago
tencent-ailab / 3m-asr
View on GitHub
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆119Jun 22, 2022Updated 4 years ago
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bfs18 / e2_tts
View on GitHub
☆70Sep 3, 2024Updated last year
DataXujing / ASR-paper
View on GitHub
ASR教程: https://dataxujing.github.io/ASR-paper/
☆26Jul 1, 2024Updated 2 years ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
xingchensong / TouchNet
View on GitHub
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
☆232Jul 2, 2026Updated 2 weeks ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
wenet-e2e / wenet_in_action_homework
View on GitHub
WeNet 实战课程作业
☆21Oct 7, 2022Updated 3 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
gkchai / SpeechToText
View on GitHub
Bi-directional streaming speech-to-text service using Cloud ASRs
☆15Aug 23, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jymh / SAP2-ASR
View on GitHub
☆26Jan 23, 2026Updated 5 months ago
MrSupW / ICMC-ASR_Baseline
View on GitHub
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆57Dec 6, 2023Updated 2 years ago
jonflynng / qwen2-audio-finetune
View on GitHub
Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.
☆24Nov 23, 2024Updated last year
skit-ai / SpeechLLM
View on GitHub
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…
☆137Jun 25, 2024Updated 2 years ago
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 4 months ago
wenet-e2e / WeTextProcessing
View on GitHub
Text Normalization & Inverse Text Normalization
☆802Jun 26, 2026Updated 3 weeks ago
mavericksgeek / Multi-objective-GA-BandClassification
View on GitHub
Band selection and classification of hyperspectral images using Multi-objective Genetic Algorithms
☆14Jan 18, 2019Updated 7 years ago