Automattic / cm-livy-scripts
Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server
☆21Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for cm-livy-scripts
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 8 years ago
- Lightweight Azkaban client☆77Updated 4 years ago
- Example to show how to stop the Spark Streaming Application Gracefully☆26Updated 7 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆51Updated 8 years ago
- Sample UDF and UDAs for Impala.☆64Updated 4 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Kafka as Hive Storage☆67Updated 10 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- ElasticSearch integration for Apache Spark☆47Updated 8 years ago
- Sample Spark Streaming application for secure consumption from Kafka☆33Updated 7 years ago
- ☆54Updated 10 years ago
- Docker image for Apache Hive running on Tez☆25Updated 9 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 7 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- ☆27Updated 3 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 7 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- GeoIP Functions for hive☆48Updated 4 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 11 years ago
- A collection of Hive UDFs☆75Updated 4 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆158Updated 2 years ago