monatis / asr-annotation-bot
Simple Telegram bot to annotate and varify automatic speech recognition datasets
☆12Updated 3 years ago
Related projects: ⓘ
- scipts for working with open.bible data☆23Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆16Updated 6 months ago
- asr2k☆48Updated 3 months ago
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆19Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Official Repository of the Deep Diacritization Paper☆16Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆24Updated 5 years ago
- Text Classification Dataset for Turkish Language☆10Updated 2 years ago
- Some tutorials used for ASR class☆32Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- Word Error Rate Estimation☆10Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- ☆10Updated 11 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆41Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- ☆38Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆15Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated this week
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆10Updated 4 months ago
- ☆16Updated 3 years ago