Tool to collect and review sentences for Common Voice
☆82May 10, 2023Updated 2 years ago
Alternatives and similar repositories for sentence-collector
Users that are interested in sentence-collector are comparing it to the libraries listed below
Sorting:
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 2 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Tooling for producing French dataset for Common Voice☆101Jan 20, 2025Updated last year
- opennlp-solr-examples☆10Jul 1, 2022Updated 3 years ago
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Mar 1, 2026Updated last week
- Thai PDPA Website (Unofficial)☆11Jun 10, 2023Updated 2 years ago
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 2 years ago
- The kinyarwanda model for deepspeech☆17May 11, 2021Updated 4 years ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Jul 15, 2025Updated 7 months ago
- A vocabulary that describes the basic elements of location information, such as geometries and addresses.☆17Sep 2, 2025Updated 6 months ago
- ☆40Feb 1, 2023Updated 3 years ago
- ☆42May 4, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- ☆17May 6, 2022Updated 3 years ago
- MODS and MADS data for the Perseus Catalog☆15Dec 19, 2025Updated 2 months ago
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- A web base JavaScript for tokenizing Thai words☆16Nov 5, 2021Updated 4 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 5 years ago
- Automatic Speech Recognition (ASR) - Kabyle☆18Nov 28, 2020Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Mar 24, 2023Updated 2 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Apr 23, 2019Updated 6 years ago
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆51Jul 31, 2025Updated 7 months ago
- ICML 2019. Turn a pre-trained GAN model into a content-addressable model without retraining.☆21Jul 25, 2024Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 4 years ago
- Yaitron English-Thai and Thai-English dictionary☆34Oct 13, 2020Updated 5 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆27Nov 14, 2021Updated 4 years ago
- ☆33Jul 13, 2024Updated last year
- An R package for Poisson multivariate adaptive shrinkage.☆11Apr 15, 2024Updated last year
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆10May 25, 2021Updated 4 years ago
- CRF syllable segmenter for Thai☆27May 3, 2024Updated last year
- A Dataset for Thai Text Summarization with over 310K articles.☆29Feb 4, 2023Updated 3 years ago
- Some tutorials used for ASR class☆31Jul 20, 2021Updated 4 years ago
- Copy favorite and commonly used RDF schemas/ontologies to a safe place☆37May 24, 2019Updated 6 years ago