gpu-poor / gramvaani_hindi_asrView external linksLinks
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Mar 26, 2022Updated 3 years ago
Alternatives and similar repositories for gramvaani_hindi_asr
Users that are interested in gramvaani_hindi_asr are comparing it to the libraries listed below
Sorting:
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Jan 18, 2023Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- ☆15May 8, 2021Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- Vim Speech Recognition Experiments☆20May 30, 2025Updated 8 months ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 5 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ☆22Apr 8, 2022Updated 3 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 7 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆18Mar 17, 2025Updated 10 months ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago