didi / athena
A release version for https://github.com/athena-team/athena
☆126Updated 2 years ago
Alternatives and similar repositories for athena:
Users that are interested in athena are comparing it to the libraries listed below
- ASR for Chinese Mandarin☆75Updated 6 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 6 years ago
- Kaldi model converter to ONNX☆241Updated 2 years ago
- A pytorch based end2end speech recognition system.☆113Updated 4 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- ☆142Updated 4 years ago
- this is a treasure-house of speech☆164Updated 6 years ago
- ☆61Updated 2 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆338Updated 4 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Updated 4 years ago
- A simple model implemented with tensorflow for voiceprint☆87Updated 6 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆197Updated 2 weeks ago
- Mandarin ASR system based on tensorflow☆108Updated 6 years ago
- ☆106Updated 4 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆119Updated 5 years ago
- 基于dVector的说话人识别keras☆88Updated 4 years ago
- Tools for ASR Corpus Generation from Online Video☆140Updated 6 years ago
- The code for aishell-3 baseline acoustic model☆67Updated 4 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆256Updated 5 years ago
- chinese tts☆74Updated 4 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- A ctc decoder for both online and offline asr model☆63Updated last year
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆76Updated 4 years ago
- simple dnn based vad☆70Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Updated 3 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Updated 2 years ago
- A Demo of Mandarin/Chinese TTS frontend☆278Updated 2 years ago
- ☆121Updated 3 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago