harvard-edge / dataperf-speech-exampleLinks
Example workflow for our data-centric speech benchmark
☆17Updated 2 years ago
Alternatives and similar repositories for dataperf-speech-example
Users that are interested in dataperf-speech-example are comparing it to the libraries listed below
Sorting:
- GroupMap: beyond mean and variance matching for deep learning☆10Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- ☆11Updated 10 years ago
- ☆15Updated 2 years ago
- ☆32Updated 2 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Sisyphus recipies for ASR☆16Updated last week
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 2 weeks ago
- ☆15Updated 6 years ago
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- Module 1 - Autodifferentiation☆22Updated 10 months ago
- Tutorial covering Open Source tools for Source Separation.☆15Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- ☆11Updated 3 years ago
- ☆10Updated this week
- ☆18Updated last year
- ☆32Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆23Updated 2 years ago
- ☆10Updated 2 years ago