coqui-ai / data-checker
π« check your data, before you wreck your model
β16Updated 2 years ago
Alternatives and similar repositories for data-checker:
Users that are interested in data-checker are comparing it to the libraries listed below
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- scipts for working with open.bible dataβ24Updated 3 years ago
- Coqui Inference Engineβ38Updated 3 years ago
- β79Updated 11 months ago
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β14Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β31Updated 2 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.β23Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Linguistic processing for Common Voiceβ55Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β23Updated 8 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ26Updated last week
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- β17Updated 3 years ago
- Swarah: Indian-English speech dataset collected across the countryβ29Updated last year
- Heteronym to Phoneme Parserβ18Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- asr2kβ50Updated 10 months ago
- Tunable pipelinesβ33Updated 2 months ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- β56Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ21Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audioβ35Updated 2 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β21Updated last year