🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for data-checker
Users that are interested in data-checker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 3 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Mar 7, 2025Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Sep 23, 2022Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 4 years ago
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Jun 25, 2024Updated last year
- GUI applikation for the Klatt formant synthesizer package☆12Feb 16, 2026Updated 2 months ago
- Mother Tongues Dictionaries dictionary creation tool☆15May 21, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…☆11Sep 8, 2023Updated 2 years ago
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- ☆20Apr 5, 2021Updated 5 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 9 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- An open source platform for browser based speech and audio subjective quality tests.☆37Nov 16, 2025Updated 5 months ago
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The Gaming Zone is a web application that provides you with a collection of classic retro games, including puzzle games, trivia games, bo…☆10Feb 11, 2020Updated 6 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆263Nov 15, 2025Updated 5 months ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆14Apr 3, 2021Updated 5 years ago
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆48Apr 18, 2025Updated 11 months ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- Split words with Unicode's default word boundary specification☆13Sep 12, 2024Updated last year
- Toy example on how to build a unit selection TTS in Spanish☆11May 10, 2019Updated 6 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Interface for using TTS and vocoder models in the form of a text editor☆19Nov 25, 2025Updated 4 months ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Docker images for Coqui AI☆61Jul 5, 2021Updated 4 years ago
- Npm package for official bc gov web font☆21Jul 14, 2025Updated 9 months ago
- A guide to building language technology in new languages.☆60Feb 1, 2022Updated 4 years ago
- Home surveillance system with facial recognition☆17Jun 10, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago