CeON / spark-utilsLinks
Practical utilities for spark applications
☆11Updated 2 years ago
Alternatives and similar repositories for spark-utils
Users that are interested in spark-utils are comparing it to the libraries listed below
Sorting:
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- Demo quering counts of a event stream with Apache Flink☆23Updated 7 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 12 years ago
- Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)☆40Updated 8 years ago
- Source code for Flink in Action☆31Updated 9 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 10 years ago
- Notes about Spark Streaming in Apache Spark☆60Updated 8 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆50Updated 11 years ago
- A tutorial on Apache Spark Unit Testing☆37Updated 10 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 10 years ago
- ☆39Updated 6 years ago
- SequenceIQ Hadoop examples☆115Updated 10 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Updated 8 years ago
- Getting started with Spark, Spark Streaming, Spark SQL, DataFrame☆36Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 11 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 9 years ago
- ☆48Updated 8 years ago
- Will come later...☆20Updated 3 years ago
- Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.☆115Updated 10 years ago
- Simple Spark Application☆76Updated 2 years ago
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Updated 9 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Updated 8 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated 2 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 7 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆55Updated 10 years ago
- Fast JVM collection☆60Updated 10 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆48Updated 10 years ago
- Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.☆112Updated 7 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆46Updated 6 years ago