assafmu / wav2letter_pytorchLinks
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated 2 years ago
Alternatives and similar repositories for wav2letter_pytorch
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
Sorting:
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- ☆12Updated 4 years ago
- ☆32Updated 3 years ago
- Convert words to numbers☆21Updated 3 years ago
- ☆11Updated 3 years ago
- Baseline convolutional ASR system in PyTorch☆21Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- ☆17Updated 2 years ago
- [DEPRECATED] Audio Module for fastai v2☆65Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated last year
- A neural language modeling toolkit built on PyTorch☆18Updated 2 years ago
- ☆32Updated 3 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated 2 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- ☆16Updated 5 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆17Updated 4 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Interspeech 2019 tutorial materials☆49Updated 5 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆21Updated 5 years ago
- ☆21Updated 5 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆42Updated 3 years ago