gbraccialli / SparkUtils
☆11Updated 9 years ago
Alternatives and similar repositories for SparkUtils:
Users that are interested in SparkUtils are comparing it to the libraries listed below
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 8 years ago
- ☆8Updated 6 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Prescriptive Applications over Kite and Hadoop☆12Updated 9 years ago
- Apache Pig plugin for Eclipse☆12Updated 8 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- ☆9Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- ☆11Updated 10 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- ☆11Updated 9 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- ☆10Updated 9 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 4 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Updated 7 years ago
- SQL Windowing Functions for Hadoop☆65Updated 2 years ago
- Spark on Kudu up and running samples☆10Updated 8 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Updated 7 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago