pranab / visitante
Set of Hadoop, Spark and Storm based tools for web and customer analytic
☆34Updated 3 years ago
Alternatives and similar repositories for visitante:
Users that are interested in visitante are comparing it to the libraries listed below
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Katta - distributed Lucene☆60Updated 12 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- A fork of cascading patterns, but implemented for trident☆71Updated last year
- chef cookbook to install Apache Spark☆10Updated 9 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆75Updated 11 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Few things we've met during our etl project based on spark☆24Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Updated 14 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 3 months ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- Distributed Elastic Message Processing System☆195Updated last year
- ☆24Updated 10 years ago
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 6 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- ☆33Updated 9 years ago
- The Chronix storage based on Apache Lucene☆47Updated 7 years ago