rflynn / spill-chickLinks
probabilistic language corrector based on google ngrams
☆21Updated 14 years ago
Alternatives and similar repositories for spill-chick
Users that are interested in spill-chick are comparing it to the libraries listed below
Sorting:
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 3 years ago
- Fast Word Clustering Software☆79Updated 11 months ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Labeled examples from wiki dumps in Python☆67Updated 9 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆101Updated 10 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 10 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- ☆62Updated 11 years ago
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Updated 11 years ago
- Framework for evaluating text extraction algorithms implemented as web services☆42Updated 13 years ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆14Updated 8 years ago
- Implicit relation extractor using a natural language model.☆24Updated 7 years ago
- Entity Linking for the masses☆56Updated 10 years ago
- Fast structured perceptron sequential labeler☆15Updated 10 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 7 years ago
- Parallel Semi-Supervised Latent Dirichlet Allocation☆33Updated 3 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Updated last year
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 13 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Model Training tool for MITIE☆79Updated 10 years ago
- N3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format☆71Updated 8 years ago
- Stand-alone service for fuzzy lookup of string labels of resources☆17Updated 8 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- ☆41Updated 9 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- Transform MCR 3.0 data to read with nltk WordNet reader. Use this to load WordNet in Spanish, among other languages, from nltk.☆25Updated 3 years ago