Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆18Nov 13, 2021Updated 4 years ago
Alternatives and similar repositories for WOLOF-ASR-Wav2Vec2
Users that are interested in WOLOF-ASR-Wav2Vec2 are comparing it to the libraries listed below
Sorting:
- ☆10Jun 23, 2023Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 4 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Mar 11, 2021Updated 4 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- asr2k☆52Jun 2, 2024Updated last year
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆51Apr 23, 2022Updated 3 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- Curate online wolof text resources that can be used to build models☆27Updated this week
- Dennis Klatt's speech synthesis system, updated with a Python interface.☆30Jun 23, 2025Updated 8 months ago
- ☆28Oct 7, 2025Updated 5 months ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆30May 6, 2024Updated last year
- Wolof is a library that you can use to do specific tasks in NLP with the Wolof language e.g. text classification in Wolof , NMT , ASR☆31Nov 28, 2023Updated 2 years ago
- derivative of the klatt 3.04 synthesizer☆40Dec 27, 2015Updated 10 years ago
- The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.☆33Nov 29, 2018Updated 7 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- This project is from the Airbnb Recruitment Challenge on Kaggle. The challenge is to solve a multi-class classification problem of predic…☆11Feb 22, 2022Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆37Jul 15, 2022Updated 3 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- A JAX library for building lattice-based speech transducer models☆47Updated this week
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆10Dec 27, 2021Updated 4 years ago
- Medical data processing and ML workshops☆10May 16, 2023Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Deep learning-based audio spoofing attack detection experiments for speaker verification.☆14Apr 20, 2023Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Oct 6, 2023Updated 2 years ago
- ☆40Jan 14, 2022Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- ☆41Mar 21, 2022Updated 3 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Jul 25, 2024Updated last year