intel-spark / InformationExtractionLinks
☆20Updated 8 years ago
Alternatives and similar repositories for InformationExtraction
Users that are interested in InformationExtraction are comparing it to the libraries listed below
Sorting:
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Question Answering via Integer Programming (TableILP)☆28Updated 9 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- Splash Project for parallel stochastic learning☆94Updated 8 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Interactive book on Statistical NLP☆32Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆105Updated 7 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆95Updated 7 years ago
- An autoencoder to calculate word embeddings as mentioned in Lebret/Collobert paper 2015☆74Updated 8 years ago
- ner using crf++☆10Updated 10 years ago
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆26Updated 4 years ago
- ☆20Updated 4 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- A set of methods that predict the future values of popularity indices for news posts using a variety of features.☆33Updated 7 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago