🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for data-checker
Users that are interested in data-checker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 3 years ago
- Linguistic processing for Common Voice☆59Jan 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Mar 7, 2025Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 4 years ago
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Jun 25, 2024Updated last year
- Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…☆11Sep 8, 2023Updated 2 years ago
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆20Apr 5, 2021Updated 5 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 10 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- The Gaming Zone is a web application that provides you with a collection of classic retro games, including puzzle games, trivia games, bo…☆10Feb 11, 2020Updated 6 years ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆262Nov 15, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Koa.js framework setup to run within Next.js API routes.☆12Updated this week
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated last month
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆48Apr 18, 2025Updated last year
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- Split words with Unicode's default word boundary specification☆13Sep 12, 2024Updated last year
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Nov 25, 2025Updated 5 months ago
- Docker images for Coqui AI☆61Jul 5, 2021Updated 4 years ago
- Npm package for official bc gov web font☆22Jul 14, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A guide to building language technology in new languages.☆60Feb 1, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Suite of web packages for creating interactive ReadAlongs☆16Apr 16, 2026Updated 2 weeks ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Various utilities regarding Levenshtein transducers. (CoffeeScript / JavaScript / Node.js)☆13Jun 20, 2016Updated 9 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- ☆55Jan 13, 2023Updated 3 years ago