vadimkantorov / convasrLinks
Baseline convolutional ASR system in PyTorch
☆21Updated last year
Alternatives and similar repositories for convasr
Users that are interested in convasr are comparing it to the libraries listed below
Sorting:
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- ☆21Updated 6 years ago
- Smart Language Model☆46Updated 2 years ago
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆20Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆42Updated 3 years ago
- ID R&D Voice Antispoofing Challenge Solution☆11Updated 6 years ago
- ☆13Updated 2 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- ☆12Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- Accentor and transcriptor for Russian language☆127Updated 3 years ago
- ☆13Updated 4 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- Python клиент API распознавания и синтеза речи Обл ака ЦРТ☆11Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- ☆21Updated 7 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Updated 4 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Updated 3 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 7 years ago
- ☆37Updated 5 months ago
- ☆56Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 4 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- ☆24Updated 5 years ago
- Convert words to numbers☆21Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆41Updated 3 years ago