assafmu / wav2letter_pytorch
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated last year
Alternatives and similar repositories for wav2letter_pytorch:
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
- [DEPRECATED] Audio Module for fastai v2☆65Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- ☆16Updated 5 years ago
- Baseline convolutional ASR system in PyTorch☆21Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- ☆11Updated 3 years ago
- The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.☆13Updated 4 years ago
- ☆20Updated 5 years ago
- ☆12Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- ☆17Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- ☆11Updated 6 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- ☆56Updated 2 years ago
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Updated 5 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- ☆31Updated 2 years ago
- ☆32Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year