coqui-ai / data-checker
π« check your data, before you wreck your model
β16Updated 2 years ago
Alternatives and similar repositories for data-checker:
Users that are interested in data-checker are comparing it to the libraries listed below
- scipts for working with open.bible dataβ24Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- Coqui Inference Engineβ38Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β20Updated 11 months ago
- β17Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformerβ47Updated 8 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β19Updated 4 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 7 months ago
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- Convert English text from written expressions into spoken formsβ24Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β32Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- β9Updated this week
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Updated 2 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.β20Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β29Updated last year
- Collection of scripts from mHuBERT-147.β24Updated 3 months ago
- Finetuning VITS Efficientlyβ32Updated last year
- phone inventory libraryβ16Updated last year
- A python package for whisper normalizerβ49Updated this week
- Linguistic processing for Common Voiceβ53Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β44Updated 3 years ago
- β11Updated 3 years ago