skalmadka / web-crawler
Distributed Web Crawler, Parser and Search Engine.
☆10Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for web-crawler
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- Focused Crawler for VT's CTRNet☆10Updated 11 years ago
- Exploration Library in Java☆12Updated last year
- Nutch 2.3.1 plugin for whitelisting/blacklisting specific HTML elements☆13Updated 2 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"☆11Updated 10 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆16Updated 8 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 6 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Allows a Storm topology to consume an AMQP exchange as an input source.☆59Updated 11 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 9 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Antelope Realtime Events framework for feature engineering in agile machine learning environments.☆26Updated 9 years ago
- Templates for projects based on top of H2O.☆37Updated 3 weeks ago
- A Java framework to build semantics-aware autoencoder neural network from a knowledge-graph.☆13Updated 7 years ago
- VoltDB Click Stream Processing Example.☆16Updated 6 years ago
- Example programs, data, and jarfiles from book "Text Processing in Java"☆19Updated 10 years ago
- Tools to evaluate accuracies of various (research papers') metadata extraction libraries☆11Updated 8 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 5 years ago
- Vizlinc☆14Updated 8 years ago
- Sparking Using Java8☆17Updated 9 years ago
- ☆20Updated 7 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimiza…☆35Updated 4 years ago