skalmadka / web-crawler
Distributed Web Crawler, Parser and Search Engine.
☆10Updated 8 years ago
Alternatives and similar repositories for web-crawler:
Users that are interested in web-crawler are comparing it to the libraries listed below
- scalding powered machine learning☆109Updated 10 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Templates for projects based on top of H2O.☆37Updated last week
- Storm / Solr Integration☆19Updated last year
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- faceted search engine☆41Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- word2vec-java☆7Updated 5 months ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- PredictionIO word2vec engine template (Scala-based parallelized engine)☆12Updated 9 years ago
- Library to use Kafka as a spout within Storm☆43Updated 13 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Vizlinc☆14Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Deep learning certificate part 1☆10Updated 2 years ago
- A fork of cascading patterns, but implemented for trident☆71Updated last year
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Updated 6 years ago