assafmu / wav2letter_pytorchLinks
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated 2 years ago
Alternatives and similar repositories for wav2letter_pytorch
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
Sorting:
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated last year
- ☆32Updated 3 years ago
- ☆12Updated 4 years ago
- ☆32Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- ☆11Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- ☆56Updated 2 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Interspeech 2019 tutorial materials☆49Updated 6 years ago
- ☆17Updated 2 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Updated 4 months ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆42Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 4 years ago
- Pytorch Implementation of WaveNODE☆64Updated 5 years ago
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- ☆26Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 6 years ago
- A Pytorch Implementation of MelGAN☆65Updated 5 years ago
- ☆21Updated 6 years ago
- PyTorch implementation of NVIDIA WaveGlow with constant memory cost.☆36Updated 2 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- Convert words to numbers☆21Updated 3 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year