tfayyaz / cloud-dataprocLinks
Cloud Dataproc: Samples and Utils
☆11Updated 5 years ago
Alternatives and similar repositories for cloud-dataproc
Users that are interested in cloud-dataproc are comparing it to the libraries listed below
Sorting:
- ☆130Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆76Updated last year
- Dataproc templates and pipelines for solving in-cloud data tasks☆148Updated last week
- ☆146Updated last year
- Repository for Beam College sessions☆112Updated 4 years ago
- Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create ana…☆73Updated last year
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆17Updated 4 years ago
- An end to end demo of Google's Cloud data and analytic stack.☆279Updated this week
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆170Updated 2 weeks ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆95Updated last year
- Repository for the dbt Semantic Layer course☆11Updated 2 months ago
- ☆42Updated 5 years ago
- A Streamlit app that provides insights on your Snowflake account usage.☆61Updated last year
- ☆16Updated last month
- markup to create labs for courses from the Google Cloud training catalog.☆49Updated 4 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 4 years ago
- DMT is an end to end automation of data warehouse migration, focused on extraction, SQL translation, data migration, data validation, etc…☆43Updated 3 weeks ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆89Updated 2 years ago
- ☆11Updated 2 years ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 4 years ago
- PySpark data-pipeline testing and CICD☆28Updated 5 years ago
- The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.☆41Updated this week
- Cloud Dataproc: Samples and Utils☆206Updated last month
- ☆72Updated this week
- ☆26Updated 3 months ago
- Data Catalog Tag Templates☆30Updated 8 months ago
- Spark in Action, 2nd edition - chapter 2☆29Updated 2 years ago
- BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.☆67Updated last year
- Snowflake demo for Financial Services☆21Updated 9 months ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆61Updated last month