tgrall / drill-workshop
Apache Drill Workshop
☆19Updated 8 years ago
Alternatives and similar repositories for drill-workshop:
Users that are interested in drill-workshop are comparing it to the libraries listed below
- Scripts to validate that a cluster is ready for MapR Data Platform installation☆85Updated 4 years ago
- Drill demo using the iPython notebook☆9Updated 9 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- Core OJAI APIs☆47Updated last year
- Materials for various Hadoop & Nifi related workshops☆52Updated 5 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆257Updated last year
- ☆54Updated 10 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 5 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 5 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 7 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- ☆70Updated 2 years ago
- Kite SDK Examples☆99Updated 3 years ago
- Navigator SDK☆22Updated 6 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 9 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 5 years ago
- DB2/DashDB Connector for Apache Spark☆14Updated 3 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 8 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 8 years ago
- Netezza Connector for Apache Spark☆13Updated 6 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago