coqui-ai / data-checker
π« check your data, before you wreck your model
β16Updated 2 years ago
Related projects β
Alternatives and complementary repositories for data-checker
- scipts for working with open.bible dataβ23Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- Scripts to create speech corpora from open.bibleβ12Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β13Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated 9 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ24Updated last year
- A JAX library for building lattice-based speech transducer modelsβ40Updated 3 weeks ago
- TTS Client for Coqui TTS serverβ13Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 3 months ago
- Coqui Inference Engineβ38Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β83Updated last month
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β18Updated 8 months ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.β20Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformerβ44Updated 4 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- A TTS model that makes a speaker speak new languagesβ75Updated 5 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- β16Updated 3 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ21Updated 4 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ21Updated this week
- Linguistic processing for Common Voiceβ52Updated 10 months ago
- Implementation of Google's USM speech model in Pytorchβ25Updated last week
- β56Updated last year
- β11Updated 3 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.β24Updated 3 weeks ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β100Updated last year
- β74Updated 3 years ago