This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
☆40Feb 10, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch_MLP_for_ASR
Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆34Apr 15, 2018Updated 7 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Learning-Recurrent-Binary-Ternary-Weights☆13Dec 4, 2018Updated 7 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆16May 25, 2019Updated 6 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Feb 19, 2018Updated 8 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- PyTorch bindings for Warp-CTC☆42Dec 6, 2019Updated 6 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- ☆13Sep 12, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- implement end-to-end asr algorithm with tensorflow☆40Aug 23, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Source data, scripts and makefiles of the experiment for the Speex codec quality evaluation☆22Aug 29, 2011Updated 14 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Dec 30, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆42Jun 25, 2018Updated 7 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks☆18Nov 5, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,396Mar 14, 2022Updated 4 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- This repository can be used to perform Speech to Text Conversion in multiple Languages, e.g., It can convert whatever you are speaking in…☆11Oct 6, 2020Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- An emulator of General Vision's CM1K neuromorphic chip, including modeling experiments using several popular public datasets☆11Jun 2, 2017Updated 8 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Sep 24, 2021Updated 4 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year