projecte-aina / lm-catalan
Official source for Catalan Language Models and resources made within Aina project.
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lm-catalan
- Deepspeech ASR Model for the Catalan Language☆17Updated 3 years ago
- ☆16Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- ☆12Updated 8 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆16Updated 3 weeks ago
- phone inventory library☆15Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated last year
- Workflow for forced alignment between languages☆17Updated 9 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last month
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- PolEval 2021 Task 1☆15Updated 2 years ago
- ☆10Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆14Updated last month
- ☆17Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- ☆11Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- ☆10Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- A collection of utilities for handling IPA phones.☆25Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆19Updated last week
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆13Updated 6 months ago
- Goodness of Pronunciation algorithm using PyKaldi☆14Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- ☆25Updated 2 years ago