GoogleCloudDataproc / jupyterhub-dataprocspawnerLinks
☆14Updated 3 years ago
Alternatives and similar repositories for jupyterhub-dataprocspawner
Users that are interested in jupyterhub-dataprocspawner are comparing it to the libraries listed below
Sorting:
- Mirror of Apache livy (Incubating)☆13Updated last year
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 3 months ago
- Apache Beam Site☆29Updated this week
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Updated 4 months ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 4 months ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- ☆54Updated 7 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Uses Cloud Build to deploy a scalable batch ingestion pipeline consisting of GCS, Cloud Functions, Dataflow and BigQuery☆22Updated 2 years ago
- Tools for creating Dataproc custom images☆34Updated last month
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 4 months ago
- Cask Hydrator Plugins Repository☆68Updated 2 weeks ago
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Updated 3 years ago
- Snippets of code used in blog posts and other media.☆13Updated 2 months ago
- Cloud Spanner Connector for Apache Spark☆17Updated 5 months ago
- Apache Fluo Muchos☆26Updated 6 months ago
- CDAP Kubernetes Operator☆19Updated 2 months ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- ☆66Updated 10 months ago
- ☆14Updated last year
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 3 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27Updated 6 years ago
- Open source tools for Google Cloud Storage and Databases.☆63Updated last year
- Mirror of Apache sdap (Incubating)☆11Updated last year
- ☆37Updated 6 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆167Updated 6 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Updated last year
- Test your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating system…☆39Updated 4 years ago
- A Docker Compose files to compose a NiFi cluster on Docker.☆35Updated 8 years ago