mzboito / IWSLT2022_Tamasheq_dataView external linksLinks
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWSLT2022.
☆18Nov 30, 2022Updated 3 years ago
Alternatives and similar repositories for IWSLT2022_Tamasheq_data
Users that are interested in IWSLT2022_Tamasheq_data are comparing it to the libraries listed below
Sorting:
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- ☆22Apr 8, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- ☆10Mar 20, 2021Updated 4 years ago
- ☆13Apr 14, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- ☆10Sep 19, 2022Updated 3 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- ☆11Nov 28, 2025Updated 2 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago