robmsmt / ASR-Audio-Data-LinksLinks
A list of publically available audio data that anyone can download for ASR or other speech activities
☆215Updated 3 years ago
Alternatives and similar repositories for ASR-Audio-Data-Links
Users that are interested in ASR-Audio-Data-Links are comparing it to the libraries listed below
Sorting:
- ASR with PyTorch☆139Updated 6 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆174Updated 5 years ago
- INTERSPEECH 2019 Tutorial Materials☆195Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆243Updated 5 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆218Updated 5 months ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆248Updated 2 weeks ago
- A pure python module for reading and writing kaldi ark files☆259Updated 4 months ago
- CMU Wilderness Multilingual Speech Dataset☆283Updated 6 years ago
- ☆259Updated 2 years ago
- experiments with RETURNN☆159Updated 2 months ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆155Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- ☆275Updated 4 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- This program calculates the word error rate of hypothesis in ASR and print the aligned result.☆155Updated 5 years ago
- Problem Agnostic Speech Encoder☆442Updated 2 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 6 years ago
- ESPnet Model Zoo☆255Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆338Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆377Updated 2 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆186Updated 5 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆227Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 7 months ago