coqui-ai / data-checker
π« check your data, before you wreck your model
β16Updated 2 years ago
Alternatives and similar repositories for data-checker:
Users that are interested in data-checker are comparing it to the libraries listed below
- scipts for working with open.bible dataβ24Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- A JAX library for building lattice-based speech transducer modelsβ45Updated 3 months ago
- β17Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β20Updated last year
- Coqui Inference Engineβ38Updated 3 years ago
- β8Updated last year
- phone inventory libraryβ16Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β32Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformerβ48Updated 9 months ago
- A handy dataset of noises for ASRβ20Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β31Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β14Updated 2 years ago
- Implementation of Google's USM speech model in Pytorchβ30Updated 2 months ago
- Simple PyTorch Denoisers for Waveform Audioβ35Updated last month
- Zero-shot Audio Classification using Whisperβ80Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45Updated 3 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago