tfayyaz / cloud-dataproc
Cloud Dataproc: Samples and Utils
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for cloud-dataproc
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 6 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆119Updated this week
- Data Catalog Tag Templates☆29Updated last month
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆77Updated last year
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Data Warehousing Made Easy with Google BigQuery and Apache Airflow☆19Updated 5 years ago
- Sample Airflow DAGs☆61Updated 2 years ago
- ☆127Updated this week
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆20Updated 4 years ago
- ☆127Updated 6 months ago
- Weekly Data Engineering Newsletter☆93Updated 4 months ago
- ☆66Updated last month
- Delta Lake Documentation☆46Updated 5 months ago
- ☆57Updated 3 weeks ago
- How to Automate SQL: dbt(data build tool) tutorial on bigquery with extensive NOTES☆31Updated last year
- ☆32Updated 7 months ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Deploys a secured BigQuery data warehouse☆77Updated 3 months ago
- Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a h…☆15Updated this week
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆13Updated 3 years ago
- Interactive Notebooks that support the book☆38Updated 4 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago
- ☆42Updated 4 years ago
- Rules based grant management for Snowflake☆40Updated 5 years ago
- Data lake, data warehouse on GCP☆54Updated 2 years ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆151Updated this week
- ☆11Updated 11 months ago
- A tutorial for the Great Expectations library.☆68Updated 3 years ago