This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
☆40Feb 10, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch_MLP_for_ASR
Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆24Sep 25, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python implementation of pre-processing for End-to-End speech recognition☆70Feb 19, 2018Updated 8 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- PyTorch bindings for Warp-CTC☆42Dec 6, 2019Updated 6 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Oct 28, 2016Updated 9 years ago
- ☆13Sep 12, 2017Updated 8 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Aug 23, 2018Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 7 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Dec 30, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆42Jun 25, 2018Updated 7 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks☆18Nov 5, 2019Updated 6 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- This repository can be used to perform Speech to Text Conversion in multiple Languages, e.g., It can convert whatever you are speaking in…☆11Oct 6, 2020Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 4 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year
- the MEX wrapper for PESQ (Perceptual Evaluation of Speech Quality)☆15May 10, 2019Updated 7 years ago
- Pytorch implementations of GMM - HMM☆10Dec 28, 2020Updated 5 years ago
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 6 years ago
- Deep neural network based speech enhancement toolkit☆220Jun 14, 2019Updated 6 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Apr 5, 2022Updated 4 years ago