đ§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
â223Jun 15, 2020Updated 5 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Keras(Tensorflow) implementations of Automatic Speech Recognitionâ24Jan 13, 2022Updated 4 years ago
- End-to-End Speech Recognition Using Tensorflowâ40Mar 24, 2023Updated 2 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.â123Apr 15, 2020Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesâ231Aug 6, 2021Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoderâ42May 29, 2019Updated 6 years ago
- End-to-End Automatic Speech Recognition on PyTorchâ304Jun 2, 2022Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorchâ594Aug 30, 2021Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCRâ34Oct 30, 2020Updated 5 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentationâ243Mar 17, 2018Updated 7 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.â65Jan 2, 2020Updated 6 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0â249Jul 15, 2025Updated 7 months ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.â130Mar 31, 2021Updated 4 years ago
- DeepSpeech based forced alignment toolâ239Dec 12, 2020Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.â808Apr 6, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.â16Jun 17, 2022Updated 3 years ago
- transformer for ASR-systerm (via tensorflow2.0)â114May 7, 2019Updated 6 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learningâ231Mar 23, 2021Updated 4 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.â17Jul 30, 2018Updated 7 years ago
- ĺçźăăăšć¨ĺŽă§ć¨ĺŽăăç¸ĺŻžčˇé˘ăăˇăłăăŤăŞăăŁăŞăăŹăźăˇă§ăłă§çľśĺŻžčˇé˘ă¸ĺ¤ćăăăăă°ăŠăâ17Dec 31, 2021Updated 4 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.â522Feb 17, 2022Updated 4 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)â100Apr 20, 2020Updated 5 years ago
- Keras implementations of Tacotron-2â27Jan 22, 2021Updated 5 years ago
- Converts spoken words into text form.â76Sep 17, 2025Updated 5 months ago
- ESPnet-TTS Audio Sample HPâ21Oct 25, 2019Updated 6 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognitionâ127Jun 10, 2019Updated 6 years ago
- Yet another speech toolkit based on Kaldi and PyTorchâ173Jul 1, 2020Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantâ10Aug 12, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.â13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeâ15Mar 26, 2022Updated 3 years ago
- XDoG(Extended Difference of Gaussians)ă˘ăŤă´ăŞăşă ăç¨ăăçˇçťć˝ĺşăŽăľăłăăŤă§ăăâ15Jan 28, 2021Updated 5 years ago
- Siamese network for unsupervised speech representation learningâ11Oct 12, 2018Updated 7 years ago
- đŚ A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognitionâ500Jun 11, 2021Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.â33Apr 24, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brainâ656Apr 5, 2022Updated 3 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )â292Aug 5, 2021Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196â320Nov 11, 2020Updated 5 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )â537Feb 9, 2022Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)â45Jun 29, 2021Updated 4 years ago
- Instructions on downloading and using the LibriAdapt datasetâ46Aug 13, 2021Updated 4 years ago