gbraccialli / SparkUtilsLinks
☆11Updated 9 years ago
Alternatives and similar repositories for SparkUtils
Users that are interested in SparkUtils are comparing it to the libraries listed below
Sorting:
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- ☆10Updated 10 years ago
- Prescriptive Applications over Kite and Hadoop☆12Updated 9 years ago
- ☆18Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- ☆8Updated 7 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- ☆11Updated 10 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 11 years ago
- ☆9Updated 9 years ago
- Flink Examples☆39Updated 9 years ago
- ☆21Updated 9 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- ☆11Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- ☆21Updated 8 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 9 years ago
- ☆9Updated 9 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Updated 8 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Repo with sources for Spark blog posts and learning experiments in Spark☆14Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago