common-voice/cv-sentence-extractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/common-voice/cv-sentence-extractor)

common-voice / cv-sentence-extractor

Scraping Wikipedia for fair use sentences

☆54

Alternatives and similar repositories for cv-sentence-extractor

Users that are interested in cv-sentence-extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

common-voice / sentence-collector
View on GitHub
Tool to collect and review sentences for Common Voice
☆83May 10, 2023Updated 3 years ago
common-voice / common-voice-bundler
View on GitHub
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆11Apr 13, 2023Updated 3 years ago
mayukhnair / deepspeech-colab
View on GitHub
Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory
☆16Mar 18, 2019Updated 7 years ago
cv-project-app / common-voice-app
View on GitHub
Repository of "CV Project" app. It's an unofficial app for Mozilla Common Voice, which permits you to contribute to this project via your…
☆114Jul 8, 2026Updated last week
JRMeyer / common-voice-stats
View on GitHub
A living document for all things Common Voice.
☆14Jun 24, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
common-voice / commonvoice-fr
View on GitHub
Tooling for producing French dataset for Common Voice
☆101Jan 20, 2025Updated last year
common-voice / community-playbook
View on GitHub
Mozilla Voice Community Playbook
☆48May 21, 2024Updated 2 years ago
coqui-ai / stt-model-manager
View on GitHub
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Mar 24, 2023Updated 3 years ago
asafu-art / deepspeech-kabyle
View on GitHub
Automatic Speech Recognition (ASR) - Kabyle
☆18Nov 28, 2020Updated 5 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
zkmkarlsruhe / language-identification
View on GitHub
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
☆42Feb 4, 2026Updated 5 months ago
Tatoeba / imouto
View on GitHub
Administrative tools for the Tatoeba website
☆16May 2, 2021Updated 5 years ago
dabinat / deepspeech-tools
View on GitHub
Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
LibreTranslate / LexiLang
View on GitHub
Simple, fast dictionary-based language detector for short texts.
☆22Feb 5, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FrancescoBia / draftify
View on GitHub
Draftify is a simple note app to write passing thought and ideas without distractions.
☆14Mar 9, 2023Updated 3 years ago
AccelerateNetworks / DeepSpeech_Frontend
View on GitHub
A webpage and API for using Mozilla DeepSpeech
☆48Feb 24, 2021Updated 5 years ago
MohammedBelkacem / corpus-kab
View on GitHub
Tuddar, ismawen d imeḍqan
☆11Jan 3, 2020Updated 6 years ago
gweltou / anaouder-cli
View on GitHub
Anaouder mouezh e Brezhoneg gant Vosk
☆15Nov 24, 2025Updated 7 months ago
wombats-writing-code / crosslinks-js
View on GitHub
Linking topics and learning resources
☆10Jun 30, 2016Updated 10 years ago
MozillaItalia / DeepSpeech-Italian-Model
View on GitHub
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
☆95Mar 15, 2022Updated 4 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
lingua-libre / RecordWizard
View on GitHub
🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.
☆17Updated this week
HarikalarKutusu / 3d-voice-chess
View on GitHub
A voice driven 3D chess game for learning Voice AI
☆17Jul 6, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mozilla / community-portal
View on GitHub
☆42Jul 28, 2025Updated 11 months ago
MohammedBelkacem / KabyleNLP
View on GitHub
Natural language processing for the kabyle language
☆16Jul 3, 2020Updated 6 years ago
MainRo / docker-deepspeech-server
View on GitHub
A dockerfile to run deepspeech-server
☆30Aug 25, 2020Updated 5 years ago
ccoreilly / LocalSTT
View on GitHub
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
☆111Jan 19, 2022Updated 4 years ago
linuxmint / mint-translations
View on GitHub
☆12Jan 8, 2026Updated 6 months ago
hakimel / reveal.js-plugins
View on GitHub
Plugins for reveal.js
☆12Jun 1, 2020Updated 6 years ago
RobinvanderVliet / Telegramo.org
View on GitHub
Ĉi tiu deponejo enhavas la fontokodon de la retejo Telegramo.org. / This repository contains the source code of the website Telegramo.org…
☆32Nov 4, 2019Updated 6 years ago
opendata-guru / data-portallist-de
View on GitHub
📚 list of all open data portals in Germany 🇩🇪
☆12Feb 6, 2023Updated 3 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
mozilla / voice-corpus-tool
View on GitHub
Tool for creation, manipulation and maintenance of voice corpora
☆82May 3, 2024Updated 2 years ago
parolteknologio / stt-esperanto
View on GitHub
Deepspeech/Coqui AI speech to text systems in Esperanto. - Parolrekoniloj en Esperanto uzante Deepspeech/Coqui Ai.
☆10Jan 11, 2022Updated 4 years ago
opening-hours / opening_hours_map
View on GitHub
Map which evaluates opening_hours related tags.
☆17Jun 17, 2026Updated last month
Keyaku / bouncy
View on GitHub
Game for Godot demonstrating OpenCV calls through GDNative
☆20May 16, 2021Updated 5 years ago
common-voice / common-voice
View on GitHub
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
☆3,471Updated this week
mozilla-extensions / firefox-voice
View on GitHub
Firefox Voice is an experiment in a voice-controlled web user agent
☆292Jan 29, 2021Updated 5 years ago
awni / py-arpa-lm
View on GitHub
Python API for reading and querying ARPA formatted language models.
☆33Sep 9, 2014Updated 11 years ago