cjmatta / drill_ipython_notebook
Drill demo using the iPython notebook
☆9Updated 9 years ago
Alternatives and similar repositories for drill_ipython_notebook:
Users that are interested in drill_ipython_notebook are comparing it to the libraries listed below
- ☆16Updated 8 years ago
- ☆7Updated 8 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆46Updated last year
- Core OJAI APIs☆47Updated last year
- Materials for various Hadoop & Nifi related workshops☆52Updated 5 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Simplify getting Zeppelin up and running☆56Updated 8 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Make your libraries magically appear in Databricks.☆47Updated last year
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- A chef cookbook for deploying spark☆30Updated 11 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Updated 9 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Scripts to validate that a cluster is ready for MapR Data Platform installation☆85Updated 4 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- One way of using Plot.ly on Zeppelin notebooks☆28Updated 9 years ago
- HDF masterclass materials☆28Updated 8 years ago
- Examples for High Performance Spark☆15Updated 3 months ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 7 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 8 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 9 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Updated 7 years ago
- The Device Manager Demo is designed to demonstrate a fully functioning modern Data/IoT application. It is a Lambda architecture built usi…☆13Updated 7 years ago