gwik / spark-cookbook
chef cookbook to install Apache Spark
☆10Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for spark-cookbook
- Sparking Using Java8☆17Updated 9 years ago
- Reactive Outlier Detection Engine☆12Updated 9 years ago
- VoltDB Click Stream Processing Example.☆16Updated 6 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 9 years ago
- A Storm based web crawler with Cassandra backend☆28Updated 11 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka☆12Updated last year
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 7 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- ☆11Updated 8 years ago
- ☆9Updated 9 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 8 years ago
- Python Implementation of Super and Hyper Log Log Sketches☆49Updated 12 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- A template-based cluster provisioning system☆61Updated last year
- Easy distributed TensorFlow on Hadoop (moved to: hops-tensorflow)☆9Updated 7 years ago