🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for data-checker
Users that are interested in data-checker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆35Mar 31, 2023Updated 3 years ago
- Linguistic processing for Common Voice☆59Jan 18, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Mar 7, 2025Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆28Sep 23, 2022Updated 3 years ago
- DEPRECATED - A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 5 years ago
- Helping travelers stranded by Trump☆10Oct 5, 2022Updated 3 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆38Jan 16, 2021Updated 5 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Jun 25, 2024Updated last year
- GUI applikation for the Klatt formant synthesizer package☆13Feb 16, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…☆11Sep 8, 2023Updated 2 years ago
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- ☆20Apr 5, 2021Updated 5 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 11 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- An open source platform for browser based speech and audio subjective quality tests.☆38May 19, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆263Nov 15, 2025Updated 6 months ago
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆14Apr 3, 2021Updated 5 years ago
- Koa.js framework setup to run within Next.js API routes.☆12May 14, 2026Updated last week
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated last month
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆47Apr 18, 2025Updated last year
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- Split words with Unicode's default word boundary specification☆13Sep 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 5 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Nov 25, 2025Updated 6 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Dec 4, 2021Updated 4 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Docker images for Coqui AI☆62Jul 5, 2021Updated 4 years ago
- Npm package for official bc gov web font☆23Jul 14, 2025Updated 10 months ago