ContinuumIO / nutchpyLinks
For interacting with nutch via Python
☆30Updated last month
Alternatives and similar repositories for nutchpy
Users that are interested in nutchpy are comparing it to the libraries listed below
Sorting:
- Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit☆39Updated 9 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Topic modeling web application☆40Updated 10 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- ☆21Updated 9 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- ☆92Updated 9 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆110Updated 9 years ago
- Mirror of Apache Stanbol (incubating)☆114Updated last year
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Vizlinc☆15Updated 9 years ago
- Unified interface for local and distributed ndarrays☆157Updated 6 years ago
- Warcbase is an open-source platform for managing analyzing web archives☆162Updated 7 years ago
- A set of benchmark problems and implementations for Python☆64Updated 2 years ago
- ☆14Updated 4 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- A system for connecting language to space and time.☆64Updated 4 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Ranking Entity Types using the Web of Data☆30Updated 8 years ago
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 9 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Looking at big data? Add a little salt.☆59Updated 2 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Updated 10 years ago