tfayyaz / cloud-dataprocLinks
Cloud Dataproc: Samples and Utils
☆11Updated 5 years ago
Alternatives and similar repositories for cloud-dataproc
Users that are interested in cloud-dataproc are comparing it to the libraries listed below
Sorting:
- ☆130Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆74Updated last year
- ☆42Updated 5 years ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆137Updated this week
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 4 years ago
- ☆144Updated 11 months ago
- ☆24Updated 2 years ago
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆16Updated 5 years ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- Interactive Notebooks that support the book☆40Updated 5 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆95Updated last year
- The go to demo for public and private dbt Learn☆80Updated 7 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Repository for Google Cloud Run Deep Dive☆11Updated 5 years ago
- Data Engineering with Spark and Delta Lake☆104Updated 2 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆22Updated last year
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 5 years ago
- Repository for Beam College sessions☆111Updated 4 years ago
- ☆36Updated 3 years ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆167Updated last week
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Delta Lake examples☆230Updated last year
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- AWS Quick Start Team☆19Updated last year
- Rules based grant management for Snowflake☆41Updated 6 years ago
- ☆80Updated last year
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆60Updated last week
- ☆90Updated 2 years ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Updated 6 years ago
- ☆104Updated 9 months ago