goodmike31 / pl-asr-speech-data-survey
Survey of available speech datasets for Polish ASR development
☆12Updated 3 weeks ago
Alternatives and similar repositories for pl-asr-speech-data-survey:
Users that are interested in pl-asr-speech-data-survey are comparing it to the libraries listed below
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- A JAX library for building lattice-based speech transducer models☆41Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 11 months ago
- ☆56Updated 2 years ago
- phone inventory library☆16Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆24Updated 3 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Finite-state script normalization and processing utilities☆38Updated last week
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- ☆20Updated 6 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- ☆9Updated 3 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support☆25Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆16Updated 2 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Updated last year
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- ☆24Updated 4 years ago