pranab / visitante
Set of Hadoop, Spark and Storm based tools for web and customer analytic
☆34Updated 3 years ago
Alternatives and similar repositories for visitante:
Users that are interested in visitante are comparing it to the libraries listed below
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Katta - distributed Lucene☆60Updated 11 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- ☆40Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- The Chronix storage based on Apache Lucene☆47Updated 7 years ago
- distributed realtime searchable database☆116Updated 10 years ago
- Mahout vector encoding for pig☆54Updated 2 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Lucene based indexing in Cassandra☆61Updated 8 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Pig on Apache Spark☆83Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- ☆24Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 9 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Presto connector to Amazon Kinesis service.☆14Updated 5 years ago
- Templates for projects based on top of H2O.☆37Updated 3 months ago
- Feature rich service discovery on ZooKeeper☆29Updated 2 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆74Updated 8 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 2 weeks ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Updated 13 years ago