errollw / gengramLinks
Lightweight ngram random text generator
☆11Updated 11 years ago
Alternatives and similar repositories for gengram
Users that are interested in gengram are comparing it to the libraries listed below
Sorting:
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 12 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- NLTK Contrib☆168Updated last year
- maximum entropy based part-of-speech tagger for NLTK☆45Updated 9 years ago
- The Cantonese Wordnet☆14Updated 2 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 10 years ago
- CS224S Course Project☆14Updated 11 years ago
- A Python interface to OpenFst☆88Updated 6 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.☆45Updated 5 years ago
- ☆16Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 10 months ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Updated 10 years ago
- Python library for n-gram models in ARPA format☆40Updated 3 years ago
- A Python package to facilitate research on building and evaluating automated scoring models.☆71Updated 11 months ago
- Fast Word Clustering Software☆79Updated 10 months ago
- Grapheme to phoneme converter for Estonian☆14Updated 4 years ago
- Identification and conversion functions for Chinese text processing☆66Updated last year
- Labeled data for homograph disambiguation☆62Updated 2 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Updated 8 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Updated 8 years ago
- pronunciation dictionaries for multiple languages☆91Updated 8 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 3 years ago
- A program to correct non-word spelling error in sentences using ngram MAP Language Models, Noisy Channel Model, Error Confusion Matrix an…☆54Updated 5 years ago
- Dialect identification using Siamese network☆15Updated 8 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 11 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Updated 6 years ago
- Python module for syllabifying English ARPABET transcriptions☆71Updated 6 years ago