gbraccialli / SparkUtils
☆11Updated 9 years ago
Alternatives and similar repositories for SparkUtils:
Users that are interested in SparkUtils are comparing it to the libraries listed below
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- ☆10Updated 10 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- ☆18Updated 8 years ago
- Prescriptive Applications over Kite and Hadoop☆12Updated 9 years ago
- ☆11Updated 9 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Stratosphere is now Apache Flink.☆197Updated last year
- ☆8Updated 7 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 8 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Flink Examples☆39Updated 8 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- ☆21Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- ☆9Updated 9 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- functionstest☆33Updated 8 years ago
- Cascading on Apache Flink®☆54Updated last year
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago