tfayyaz / cloud-dataprocLinks
Cloud Dataproc: Samples and Utils
☆11Updated 4 years ago
Alternatives and similar repositories for cloud-dataproc
Users that are interested in cloud-dataproc are comparing it to the libraries listed below
Sorting:
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆69Updated last year
- ☆128Updated last year
- ☆11Updated last year
- ☆137Updated 7 months ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 4 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆129Updated this week
- Data Quality Engine for BigQuery☆275Updated last month
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- ☆36Updated 3 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- markup to create labs for courses from the Google Cloud training catalog.☆49Updated 3 years ago
- ☆80Updated 8 months ago
- ☆62Updated last week
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- Cloud Dataproc: Samples and Utils☆203Updated last week
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆83Updated last year
- ☆89Updated 5 months ago
- Data Catalog Tag Templates☆30Updated last month
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 5 months ago
- ☆42Updated 5 years ago
- Repository for TUTI (Template for UCAIP Training and Inference)☆16Updated 3 years ago
- ☆17Updated 2 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- ☆20Updated 5 years ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆69Updated 2 months ago
- ☆17Updated 10 months ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- ☆86Updated 2 years ago