max-webster / get-started-impala
This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)
☆22Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for get-started-impala
- Kite SDK Examples☆99Updated 3 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- ☆54Updated 10 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago
- An example Apache Beam project.☆111Updated 7 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- Notes about Spark Streaming in Apache Spark☆58Updated 7 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Cascading on Apache Flink®☆54Updated 9 months ago
- Oozie Samples☆51Updated 10 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆108Updated 6 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- Utility to easily copy files into HDFS☆69Updated 4 years ago
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Updated 8 years ago
- Magic to help Spark pipelines upgrade☆34Updated last month
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- These are some code examples☆55Updated 4 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated 11 months ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago