AutoCorpus is a set of utilities that enable automatic extraction of language corpora and language models from publicly available datasets. Autocorpus utilities follow the Unix design philosophy and integrate easily into custom data processing pipelines.
☆37Feb 1, 2012Updated 14 years ago
Alternatives and similar repositories for AutoCorpus
Users that are interested in AutoCorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- natural language processing with link-grammar☆17Sep 30, 2009Updated 16 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- ☆13Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Apr 28, 2021Updated 5 years ago
- my internet website and web blog☆17Jul 18, 2025Updated 11 months ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 3 years ago
- Speech Processing & Linguistic Analysis Tool☆11Jun 30, 2019Updated 7 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java☆15Jul 14, 2018Updated 7 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- Simple script that crawls the Android Marketplace☆36Jan 15, 2016Updated 10 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆15Jan 24, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 6 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- An app that graphs and compares the pitch contours of spoken language, to help language learners perfect their intonation (Hackbright Spr…☆31Jul 20, 2017Updated 8 years ago
- An Online Logic Assistant Based on Coq☆25Feb 15, 2012Updated 14 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- SVM Classifier to Detect Sentiment of Tweets☆16Apr 20, 2015Updated 11 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- ☆22Jul 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Apr 10, 2014Updated 12 years ago
- small python app to help practice speech shadowing, helpful for language learning☆16Jun 25, 2020Updated 6 years ago
- Java interfaces and tools for Kaldi speech recognition.☆20Oct 2, 2016Updated 9 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- ☆25Jun 14, 2022Updated 4 years ago
- A sdk for AlchemyAPI in Javascript - Please note that this legacy AlchemyAPI SDK is no longer supported by IBM. Please use the Watson SDK…☆65Sep 28, 2016Updated 9 years ago
- Audio Diarization Annotation tool☆30Nov 8, 2019Updated 6 years ago
- Rails Application Demonstrating IronWorker Usage☆25Nov 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Social media crawling engine implemented totally browser side in JS☆98Jul 10, 2016Updated 9 years ago
- A treemap implementation using canvas (initial stage)☆14Mar 14, 2011Updated 15 years ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- a javascript source code formatter☆32Oct 19, 2010Updated 15 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- MATLAB functions that interface with the HTK Speech Recognition Toolkit (http://htk.eng.cam.ac.uk/) for training HMMs, GMMs and simple sp…☆46Jan 4, 2017Updated 9 years ago