mxer/awesome-speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mxer/awesome-speech)

mxer / awesome-speech

this is a treasure-house of speech

☆168

Alternatives and similar repositories for awesome-speech

Users that are interested in awesome-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆81Jun 10, 2019Updated 7 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
theblackcat102 / edgedict
View on GitHub
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
☆292Aug 5, 2021Updated 4 years ago
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
YoavRamon / awesome-kaldi
View on GitHub
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆536Feb 9, 2022Updated 4 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
jinserk / pytorch-asr
View on GitHub
ASR with PyTorch
☆139Mar 10, 2019Updated 7 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
espnet / interspeech2019-tutorial
View on GitHub
INTERSPEECH 2019 Tutorial Materials
☆194Mar 30, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
zerospeech / zerospeech2021_baseline
View on GitHub
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Oct 19, 2022Updated 3 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆734Mar 18, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
for-github-backup / deprecated.github.io
View on GitHub
☆57Oct 6, 2021Updated 4 years ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆37Aug 15, 2019Updated 6 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
r9y9 / nnmnkwii_gallery
View on GitHub
A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.
☆70May 15, 2020Updated 6 years ago
Byshx / Poccala
View on GitHub
An Automatic Speech Recognition using GMM & HMM.
☆19Aug 16, 2019Updated 6 years ago
yangxueruivs / DFSMN
View on GitHub
Tensorflow version of DFSMN
☆49Jul 17, 2018Updated 8 years ago
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
tzyll / kaldi
View on GitHub
ASR cases for speech handbook at CSLT-THU, based on Kaldi toolkit and Thchs30 database, in egs/cslt_cases.
☆107Mar 12, 2021Updated 5 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago