🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for data-checker
Users that are interested in data-checker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 2 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Sep 23, 2022Updated 3 years ago
- Helping travelers stranded by Trump☆10Oct 5, 2022Updated 3 years ago
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆36Jan 16, 2021Updated 5 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Jun 25, 2024Updated last year
- GUI applikation for the Klatt formant synthesizer package☆11Feb 16, 2026Updated last month
- Mother Tongues Dictionaries dictionary creation tool☆15May 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…☆11Sep 8, 2023Updated 2 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- ☆20Apr 5, 2021Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 9 months ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆262Nov 15, 2025Updated 4 months ago
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆14Apr 3, 2021Updated 4 years ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Feb 11, 2026Updated last month
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- Split words with Unicode's default word boundary specification☆13Sep 12, 2024Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆19Nov 25, 2025Updated 4 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Docker images for Coqui AI☆61Jul 5, 2021Updated 4 years ago
- Home surveillance system with facial recognition☆17Jun 10, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Suite of web packages for creating interactive ReadAlongs☆16Mar 16, 2026Updated last week
- A Python wrapper for libhackrf☆12Jul 10, 2023Updated 2 years ago
- Various utilities regarding Levenshtein transducers. (CoffeeScript / JavaScript / Node.js)☆13Jun 20, 2016Updated 9 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago