ThinkBigAnalytics / Hive-Extensions-from-Think-Big-Analytics
Reusable code for Hive
☆16Updated 10 years ago
Alternatives and similar repositories for Hive-Extensions-from-Think-Big-Analytics:
Users that are interested in Hive-Extensions-from-Think-Big-Analytics are comparing it to the libraries listed below
- Cascading on Apache Flink®☆54Updated last year
- functionstest☆33Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 8 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 6 years ago
- Recipes and examples for Apache Spark☆13Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Kite SDK Examples☆98Updated 3 years ago
- Test your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating system…☆39Updated 4 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Updated last year
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 8 years ago
- SQL for Kafka Connectors☆98Updated last year
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 2 months ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- something to help you spark☆65Updated 6 years ago
- Random implementation notes☆33Updated 12 years ago
- Aerospike Spark Connector☆35Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- ☆76Updated 8 years ago