freewym / espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
β943Updated 7 months ago
Alternatives and similar repositories for espresso:
Users that are interested in espresso are comparing it to the libraries listed below
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,318Updated 9 months ago
- A Python wrapper for Kaldiβ1,011Updated 2 months ago
- Open tools and data for cloudless automatic speech recognitionβ447Updated 4 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ468Updated 5 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,β¦β2,384Updated 3 years ago
- Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLPβ1,559Updated 3 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β963Updated this week
- End-to-end ASR/LM implementation with PyTorchβ596Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.β376Updated last year
- PyTorch implementations of neural network models for keyword spottingβ515Updated last year
- Tools for handling speech data in machine learning projects.β1,001Updated this week
- g2p: English Grapheme To Phoneme Conversionβ847Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β859Updated last year
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,864Updated 2 years ago
- Problem Agnostic Speech Encoderβ440Updated last year
- FSA/FST algorithms, differentiable, with PyTorch compatibility.β1,182Updated 3 weeks ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β365Updated 3 months ago
- Efficient neural speech synthesisβ1,162Updated 6 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 4 years ago
- CMU Wilderness Multilingual Speech Datasetβ278Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ205Updated 3 years ago
- Speech Recognition using DeepSpeech2.β2,114Updated 2 years ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)β705Updated last month
- A neural network for end-to-end speech denoisingβ690Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )β292Updated 3 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learningβ225Updated 4 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ528Updated 2 years ago
- End-2-end speech synthesis with recurrent neural networksβ226Updated last year
- A PyTorch Implementation of End-to-End Models for Speech-to-Textβ758Updated last year