This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
☆40Feb 10, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch_MLP_for_ASR
Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆34Apr 15, 2018Updated 7 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Learning-Recurrent-Binary-Ternary-Weights☆13Dec 4, 2018Updated 7 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆16May 25, 2019Updated 6 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Feb 19, 2018Updated 8 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- PyTorch bindings for Warp-CTC☆42Dec 6, 2019Updated 6 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- ☆13Sep 12, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- speech-aligner,是一个从“人声语音”及其“语 言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Source data, scripts and makefiles of the experiment for the Speex codec quality evaluation☆22Aug 29, 2011Updated 14 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Dec 30, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆42Jun 25, 2018Updated 7 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks☆18Nov 5, 2019Updated 6 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,397Mar 14, 2022Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository can be used to perform Speech to Text Conversion in multiple Languages, e.g., It can convert whatever you are speaking in…☆11Oct 6, 2020Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Sep 24, 2021Updated 4 years ago