Tool to collect and review sentences for Common Voice
☆83May 10, 2023Updated 3 years ago
Alternatives and similar repositories for sentence-collector
Users that are interested in sentence-collector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 3 years ago
- Command line tool to create corpora for Common Voice☆78Mar 25, 2026Updated 2 months ago
- Metadata and versioning details for the Common Voice dataset☆171Apr 10, 2026Updated last month
- Common Voice is part of Mozilla's initiative to help teach machines how real people speak.☆3,468May 22, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- Tooling for producing French dataset for Common Voice☆101Jan 20, 2025Updated last year
- Repository of "CV Project" app. It's an unofficial app for Mozilla Common Voice, which permits you to contribute to this project via your…☆114May 20, 2025Updated last year
- Mozilla Voice Community Playbook☆48May 21, 2024Updated 2 years ago
- ☆14Jun 22, 2020Updated 5 years ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Jan 4, 2023Updated 3 years ago
- Thai PDPA Website (Unofficial)☆11Jun 10, 2023Updated 2 years ago
- The kinyarwanda model for deepspeech☆17May 11, 2021Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep learning for thai romanization.☆14Jul 30, 2022Updated 3 years ago
- Group coding repository of PltCov, a tool to instrument ELF binaries for fuzzing with ngram coverage of imported APIs☆12Jan 18, 2022Updated 4 years ago
- DEPRECATED - Archived. Formerly a meta repository for all DinoPark issues☆18May 14, 2019Updated 7 years ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Jul 15, 2025Updated 10 months ago
- ☆42May 4, 2024Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆41Feb 4, 2026Updated 3 months ago
- ☆40Feb 1, 2023Updated 3 years ago
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 3 years ago
- A web base JavaScript for tokenizing Thai words☆16Nov 5, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- ☆17May 6, 2022Updated 4 years ago
- ICML 2019. Turn a pre-trained GAN model into a content-addressable model without retraining.☆21Jul 25, 2024Updated last year
- ☆13Dec 15, 2022Updated 3 years ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆53Apr 7, 2026Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- A component to consume with many threads from Kafka☆12Jul 6, 2023Updated 2 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- opennlp-solr-examples☆10Jul 1, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Automatic Speech Recognition (ASR) - Kabyle☆18Nov 28, 2020Updated 5 years ago
- OSM vector tiles on IPFS☆29Jan 18, 2017Updated 9 years ago
- Dependency parser on Thai language☆27Jan 25, 2025Updated last year
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Data from the Sequoia treebank.☆11May 6, 2026Updated 3 weeks ago
- swagger-service tutorial using Duct☆13Aug 30, 2017Updated 8 years ago