shiyuzh2007 / ASR
☆55Updated 4 years ago
Alternatives and similar repositories for ASR:
Users that are interested in ASR are comparing it to the libraries listed below
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆75Updated 3 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- ☆75Updated 2 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆54Updated last year
- mWER loss implementation in tensorflow☆31Updated 4 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- ☆61Updated 2 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆59Updated 4 years ago
- A python IO interface for data accessing in kaldi☆38Updated 3 years ago
- ☆106Updated 3 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 6 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆137Updated 3 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- Custom decoders for Kaldi☆79Updated 5 years ago
- ☆41Updated 6 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- it's a train acoustics model code lib☆26Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago