ContinuumIO / nutchpyLinks
For interacting with nutch via Python
☆29Updated 2 months ago
Alternatives and similar repositories for nutchpy
Users that are interested in nutchpy are comparing it to the libraries listed below
Sorting:
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated last year
- Mirror of Apache sdap (Incubating)☆11Updated last year
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Updated 9 years ago
- Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit☆39Updated 9 years ago
- Vizlinc☆15Updated 9 years ago
- ☆21Updated 9 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- Topic modeling web application☆41Updated 9 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago
- Linking DBpedia to SciGraph☆14Updated 7 years ago
- ☆20Updated 8 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- MITIE: library and tools for information extraction☆29Updated 10 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Saul : Declarative Learning-Based Programming☆64Updated 5 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- Run IPython, Pattern, NLTK, Pandas, NumPy, SciPy, Numba, Biopython inside Docker☆47Updated 10 years ago
- Operations for Immutable Notebook Documents☆29Updated 8 years ago
- ☆13Updated 10 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Scraper built with Scrapy.☆18Updated 10 months ago
- Default Repo description from terraform module☆5Updated 5 years ago
- utilities for filesystem exploration and automated builds☆21Updated this week
- stav text annotation visualiser☆34Updated 13 years ago
- RESTful API around the PETRARCH coding software☆10Updated 4 years ago