skalmadka / web-crawler
Distributed Web Crawler, Parser and Search Engine.
☆10Updated 8 years ago
Alternatives and similar repositories for web-crawler:
Users that are interested in web-crawler are comparing it to the libraries listed below
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- word2vec-java☆7Updated 4 months ago
- Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimiza…☆35Updated 4 years ago
- scalding powered machine learning☆109Updated 10 years ago
- Deep learning certificate part 1☆10Updated 2 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Allows a Storm topology to consume an AMQP exchange as an input source.☆59Updated 11 years ago
- MIT Big Data Challenge☆14Updated 10 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- ☆24Updated 9 years ago
- Sparking Using Java8☆17Updated 9 years ago
- big data books and papers☆30Updated 11 years ago
- Templates for projects based on top of H2O.☆37Updated 3 months ago
- Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"☆11Updated 10 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Neural Network engine for Veles distributed machine learning platform☆26Updated 8 years ago
- ☆20Updated 7 years ago
- Focused Crawler for VT's CTRNet☆10Updated 11 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- Includes Code for Inference and Evaluation of Topic Models for Selectional Preferences☆16Updated last year
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 8 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Distributed optimization framework with parameter server☆23Updated 9 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago