GoogleCloudDataproc / jupyterhub-dataprocspawner
☆14Updated 2 years ago
Alternatives and similar repositories for jupyterhub-dataprocspawner:
Users that are interested in jupyterhub-dataprocspawner are comparing it to the libraries listed below
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated last year
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated last month
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Updated 2 weeks ago
- Tools for creating Dataproc custom images☆32Updated 2 weeks ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 2 weeks ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Spark pipelines that correspond to a series of Dataflow examples.☆27Updated 5 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago
- Cask Hydrator Plugins Repository☆67Updated this week
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- ☆37Updated 5 years ago
- Magic to help Spark pipelines upgrade☆34Updated 4 months ago
- ☆54Updated 7 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- An example Apache Beam project.☆111Updated 7 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Sample programs for MapR Streams compatible with Apache Kafka 0.9 API☆15Updated last year
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆42Updated 8 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆90Updated last year
- HDF masterclass materials☆28Updated 8 years ago
- Livy Manager - Web UI for Managing Apache Livy Sessions☆16Updated 7 years ago
- Modeling Lifecycle with ACME Occupancy Detection and Cloudera☆14Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year