tfayyaz / cloud-dataprocLinks
Cloud Dataproc: Samples and Utils
☆11Updated 4 years ago
Alternatives and similar repositories for cloud-dataproc
Users that are interested in cloud-dataproc are comparing it to the libraries listed below
Sorting:
- ☆129Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆71Updated last year
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 3 years ago
- ☆139Updated 7 months ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆131Updated last week
- Repository for Beam College sessions☆109Updated 4 years ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated 2 years ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆71Updated 3 years ago
- ☆91Updated 6 months ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆83Updated last year
- ☆42Updated 5 years ago
- An end to end demo of Google's Cloud data and analytic stack.☆259Updated last week
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆160Updated 5 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 4 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆93Updated 11 months ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 5 months ago
- Interactive Notebooks that support the book☆40Updated 4 years ago
- ☆63Updated 3 weeks ago
- The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.☆35Updated last month
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- ☆11Updated last year
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- ☆46Updated last year
- The go to demo for public and private dbt Learn☆79Updated 3 months ago
- Cloud Dataproc: Samples and Utils☆203Updated last month
- Data Engineering with Spark and Delta Lake☆101Updated 2 years ago
- Data Catalog Tag Templates☆30Updated 2 months ago
- Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create ana…☆72Updated last year
- Cloned by the `dbt init` task☆60Updated last year