DhruvKumar / spark-workshop
☆10Updated 10 years ago
Alternatives and similar repositories for spark-workshop:
Users that are interested in spark-workshop are comparing it to the libraries listed below
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Single view demo☆14Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- ☆21Updated 9 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Updated 4 years ago
- ☆14Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- POC: Spark consumer for bottledwater-pg Kafka Avro topics☆16Updated 4 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- ☆26Updated 5 years ago
- Random implementation notes☆33Updated 11 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 10 years ago
- Recipes and examples for Apache Spark☆13Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Updated 9 years ago
- Cascading on Apache Flink®☆54Updated last year
- An application to monitor and drive the Spark JobServer☆11Updated 10 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Updated 9 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- A simple Twitter-Streaming Application for Apache Flink☆21Updated 9 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- functionstest☆33Updated 8 years ago