erosenfeld / Thesaurus
A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provides words used in similar contexts. It also provides actual values, so certainty of similarity can be properly gauged.
☆15Updated 11 years ago
Alternatives and similar repositories for Thesaurus:
Users that are interested in Thesaurus are comparing it to the libraries listed below
- Client for Stanford Named Entity Reconginiton☆27Updated 6 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 11 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Updated 9 years ago
- js utility for summarizing large bodies of text using a basic sentence relevance ranking algorithm☆100Updated 9 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Performs multi document summarization. Includes a method to generate summaries: The method uses a sentence importance score calculator ba…☆37Updated 11 years ago
- Machine translation for the real world☆23Updated 5 years ago
- This repository contains a resurrected and repaired version of OpenEphyra, from https://mu.lti.cs.cmu.edu/trac/Ephyra/wiki/OpenEphyra.☆123Updated 5 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated last month
- NLP tools developed by Emory University.☆60Updated 8 years ago
- A client for the Stanford Part of Speech Tagger XMLRPC server.☆72Updated 7 years ago
- The linked open dataset described at http://datahub.io/dataset/vu-wordnet, and the tools used to create it☆25Updated 4 years ago
- An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io☆79Updated 2 years ago
- A parts-of-speech chunker.☆30Updated 9 years ago
- English Dependency Relationship Extractor☆85Updated 3 months ago
- My Part of Speech Tagger☆42Updated 8 years ago
- SKOS analysis for Elasticsearch☆54Updated 8 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- Multilingual automatic text summarizer using statistical approach and extraction☆34Updated 5 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆111Updated this week
- Nodejs wrapper for Stanford Classifier.☆47Updated 4 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆91Updated 6 years ago
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆14Updated 7 years ago
- Wikipedia Tools for Google Spreadsheets — Install:☆149Updated 6 months ago
- http://www.ark.cs.cmu.edu/ARKref/☆32Updated 10 years ago