assafmu / wav2letter_pytorch
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for wav2letter_pytorch
- ☆19Updated 5 years ago
- ☆12Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- [DEPRECATED] Audio Module for fastai v2☆65Updated last year
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- A collection of utilities for handling IPA phones.☆24Updated last year
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- ☆11Updated 3 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated last year
- ☆56Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.☆13Updated 4 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated last year
- ☆16Updated 5 years ago
- ☆17Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- ☆12Updated 8 months ago
- Fast and differentiable hidden Markov model in C++☆15Updated last year
- ☆16Updated 5 years ago
- An audio classification system for learning with out-of-distribution data☆32Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…☆15Updated 3 months ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 3 years ago
- DenseNets for the detection of singing birds in audio files☆17Updated 6 years ago