assafmu / wav2letter_pytorchLinks
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated 2 years ago
Alternatives and similar repositories for wav2letter_pytorch
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
Sorting:
- ☆32Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- ☆12Updated 4 years ago
- ☆16Updated 7 years ago
- ☆11Updated 4 years ago
- ☆32Updated 3 years ago
- Fast and differentiable hidden Markov model in C++☆18Updated 2 years ago
- Applying reinforcement learning to perform source separation.☆23Updated 5 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated last year
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- Interspeech 2019 tutorial materials☆49Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- Convert words to numbers☆21Updated 3 years ago
- ☆29Updated 5 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Updated 3 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 5 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated 2 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆39Updated 2 years ago
- ☆17Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆22Updated 2 years ago
- ☆26Updated 4 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- Example workflow for our data-centric speech benchmark☆17Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago