smartive / docker-nutch-elasticsearch-mongodb
Docker Image for Apache Nutch, Elasticsearch and MongoDB
☆8Updated 7 years ago
Alternatives and similar repositories for docker-nutch-elasticsearch-mongodb:
Users that are interested in docker-nutch-elasticsearch-mongodb are comparing it to the libraries listed below
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated this week
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 10 years ago
- ☆16Updated 8 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 9 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆57Updated 11 years ago
- D3 and Play based visualization for entity-relation graphs, especially for NLP and information extraction☆29Updated 9 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆64Updated 8 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Updated 10 years ago
- TextFlows is an open-source online platform for composition, execution, and sharing of interactive text mining and natural language proce…☆19Updated 7 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Code examples and data for the KiwiPyCon 2014 NLP tutorial☆39Updated 10 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Contains data, format checker, scorer and baselines for the CLEF2018 Fact Checking Lab.☆23Updated 5 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆17Updated 10 years ago
- Mention-anomaly-based event detection and tracking in Twitter☆17Updated 8 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23Updated 9 years ago
- Expletives vomiting library...☆13Updated 7 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- A set of tools for performing Labeled Latent Dirichlet Allocation on textual datasets, with an emphasis on Twitter profiles. Contains too…☆42Updated 3 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 7 years ago
- Java library for Concrete, a data serialization format for NLP☆6Updated 5 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago