projecte-aina / lm-catalan
Official source for Catalan Language Models and resources made within Aina project.
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lm-catalan
- Deepspeech ASR Model for the Catalan Language☆17Updated 3 years ago
- phone inventory library☆15Updated last year
- ☆16Updated 3 years ago
- ☆10Updated 2 years ago
- ☆12Updated 8 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- ☆11Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆14Updated last month
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- ☆17Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- ASR text preprocessing utility☆20Updated 3 months ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆13Updated 6 months ago
- A collection of utilities for handling IPA phones.☆24Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- radiomixer☆14Updated 2 years ago
- PolEval 2021 Task 1☆15Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- ☆10Updated 3 years ago