turian / kea-serviceLinks
KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service
☆42Updated 14 years ago
Alternatives and similar repositories for kea-service
Users that are interested in kea-service are comparing it to the libraries listed below
Sorting:
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 8 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆161Updated 3 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 4 years ago
- Natural language Understanding Toolkit☆119Updated 11 years ago
- Pretty fast parser for probabilistic context free grammars☆88Updated 12 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆101Updated 10 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆107Updated 12 years ago
- This is a fork of the Stanford Named Entity Recognizer with added support for deploying in Java servlet mode. See github.com/dat/pyner fo…☆91Updated 13 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 11 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- Topic modeling web application☆40Updated 10 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 14 years ago
- Text classification using Naive Bayes and Elasticsearch☆152Updated 9 years ago
- Jeremy's Machine Learning Library☆52Updated 9 years ago
- ☆20Updated 8 years ago
- ☆44Updated 10 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Updated last year
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 11 years ago
- Keeps a mirror of DBpedia live in sync☆27Updated 4 years ago
- Common Crawl support library to access 2008-2012 crawl archives (ARC files)☆505Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆59Updated 4 years ago
- A Graph Server (no longer active - see Apache TinkerPop)☆431Updated 2 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 4 years ago