assafmu / wav2letter_pytorch
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Updated 2 years ago
Alternatives and similar repositories for wav2letter_pytorch:
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- ☆31Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 3 years ago
- ☆12Updated 3 years ago
- ☆11Updated 3 years ago
- ☆16Updated 5 years ago
- ☆9Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.☆13Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- ☆32Updated 3 years ago
- ☆17Updated 2 years ago
- WaveNet implementation using tf.estimator☆21Updated last year
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 weeks ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Embedded segmental K-means (ES-KMeans) in Python.☆14Updated last year
- ☆56Updated 2 years ago
- ☆26Updated 4 years ago
- Convert words to numbers☆20Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- ☆14Updated 6 years ago
- ☆8Updated 7 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 9 months ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago