cjmatta / drill_ipython_notebookLinks

Drill demo using the iPython notebook

☆9

Alternatives and similar repositories for drill_ipython_notebook

Users that are interested in drill_ipython_notebook are comparing it to the libraries listed below

Sorting:

mapr-demos / drill-spot-price-history
☆7Updated 9 years ago
ojai / ojai
Core OJAI APIs
☆47Updated last year
mapr / mapr-docker-multi
☆16Updated 8 years ago
ShopRunner1 / stork
Make your libraries magically appear in Databricks.
☆47Updated last year
randerzander / jupyter-service
Ambari Service definition for an Jupyter (IPython3) Notebook service
☆42Updated 8 years ago
beljun / zeppelin-plotly
One way of using Plot.ly on Zeppelin notebooks
☆28Updated 9 years ago
holdenk / high-performance-spark-examples
Examples for High Performance Spark
☆15Updated 7 months ago
CODAIT / spark-db2
DB2/DashDB Connector for Apache Spark
☆14Updated 3 years ago
hkropp / vagrant-hdp
Beyond HDP Sandbox
☆18Updated 9 years ago
snowplow-archive / spark-streaming-example-project
A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB
☆94Updated 4 years ago
snowplow-archive / google-cloud-dataflow-example-project
Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow
☆30Updated 8 years ago
jbenninghoff / cluster-validation
Scripts to validate that a cluster is ready for MapR Data Platform installation
☆85Updated 5 years ago
claudiofahey / isilon-hadoop-tools
Tools to deploy Hadoop on EMC Isilon
☆17Updated 8 years ago
max-webster / get-started-impala
This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)
☆22Updated 7 years ago
SamHjelmfelt / Ember
Ambari and Cloudera Manager in Docker
☆22Updated 6 years ago
abajwa-hw / single-view-demo
Single view demo
☆14Updated 9 years ago
chennavarri / aws-lambda-pandas-sample
Required packages for using pandas in AWS Lambda functions
☆45Updated 8 years ago
memsql / streamliner-examples
Example code for building your own MemSQL Streamliner Pipelines
☆23Updated 8 years ago
FINRAOS / herd
Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…
☆135Updated 2 years ago
redsymbol / csv2parquet
Create Parquet files from CSV
☆68Updated 7 years ago
cloudera / director-scripts
Cloudera Director sample code
☆61Updated 5 years ago
onetapbeyond / lambda-spark-executor
Apache Spark AWS Lambda Executor (SAMBA)
☆44Updated 6 years ago
potix2 / spark-google-spreadsheets
Google Spreadsheets datasource for SparkSQL and DataFrames
☆57Updated last year
hortonworks / structor
Vagrant files creating multi-node virtual Hadoop clusters with or without security.
☆67Updated 5 years ago
bythebay / pipeline
Complete Pipeline Training at Big Data Scala By the Bay
☆71Updated 9 years ago
seanorama / masterclass
Materials for various Hadoop & Nifi related workshops
☆51Updated 6 years ago
tgrall / drill-workshop
Apache Drill Workshop
☆19Updated 9 years ago
atlassian / themis
Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)
☆47Updated last year
hellonarrativ / spectrify
Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.
☆117Updated 2 years ago
adobe-research / spark-cluster-deployment
Automates Spark standalone cluster tasks with Puppet and Fabric.
☆43Updated 10 years ago