rolczynski / Automatic-Speech-RecognitionView external linksLinks
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
☆223Jun 15, 2020Updated 5 years ago
Alternatives and similar repositories for Automatic-Speech-Recognition
Users that are interested in Automatic-Speech-Recognition are comparing it to the libraries listed below
Sorting:
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆24Jan 13, 2022Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆232Aug 6, 2021Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆243Mar 17, 2018Updated 7 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆249Jul 15, 2025Updated 7 months ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Mar 31, 2021Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114May 7, 2019Updated 6 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆230Mar 23, 2021Updated 4 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.☆17Jul 30, 2018Updated 7 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆521Feb 17, 2022Updated 3 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 5 years ago
- Keras implementations of Tacotron-2☆27Jan 22, 2021Updated 5 years ago
- Converts spoken words into text form.☆76Sep 17, 2025Updated 4 months ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆499Jun 11, 2021Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Aug 5, 2021Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Feb 9, 2022Updated 4 years ago
- Instructions on downloading and using the LibriAdapt dataset☆46Aug 13, 2021Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Jun 29, 2021Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 6 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆1,005Jun 11, 2025Updated 8 months ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Dec 4, 2018Updated 7 years ago