skalmadka / web-crawlerLinks
Distributed Web Crawler, Parser and Search Engine.
☆10Updated 9 years ago
Alternatives and similar repositories for web-crawler
Users that are interested in web-crawler are comparing it to the libraries listed below
Sorting:
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 10 years ago
 - Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"☆11Updated 11 years ago
 - Movielens collaborative filtering with Solr streaming expression☆11Updated 9 years ago
 - Collects multimedia content shared through social networks.☆19Updated 10 years ago
 - NLP Utilities in Java☆43Updated 2 years ago
 - Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 9 years ago
 - ☆20Updated 8 years ago
 - A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
 - Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 10 years ago
 - Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
 - Focused Crawler for VT's CTRNet☆10Updated 12 years ago
 - Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
 - t test