gbraccialli / SparkUtils
☆11Updated 9 years ago
Alternatives and similar repositories for SparkUtils:
Users that are interested in SparkUtils are comparing it to the libraries listed below
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- ☆11Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- ☆10Updated 10 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- Prescriptive Applications over Kite and Hadoop☆12Updated 9 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 9 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- ☆8Updated 7 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- ☆18Updated 9 years ago
- ☆11Updated 10 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 9 years ago
- ☆21Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Sparking Using Java8☆17Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Flink Examples☆39Updated 9 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- Stratosphere is now Apache Flink.☆197Updated last year
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- ☆9Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago