GoogleCloudPlatform / dataproc-scala-examplesLinks
Dataproc Scala Examples is an effort to assist in the creation of Spark jobs written in Scala to run on Dataproc.
☆12Updated last year
Alternatives and similar repositories for dataproc-scala-examples
Users that are interested in dataproc-scala-examples are comparing it to the libraries listed below
Sorting:
- ☆21Updated 2 years ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆148Updated this week
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆172Updated last month
- Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course☆751Updated 3 weeks ago
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆33Updated 2 years ago
- ☆179Updated 5 months ago
- Practice your Pyspark skills!☆100Updated 4 years ago
- ☆37Updated last month
- An end to end demo of Google's Cloud data and analytic stack.☆279Updated last week
- ☆52Updated last month
- Apartments Data Pipeline using Airflow and Spark.☆23Updated 3 years ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆585Updated last month
- This repository goes over how to handle massive variety in data engineering☆311Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆284Updated last year
- Code snippets for Data Engineering Design Patterns book☆331Updated last month
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Updated 5 years ago
- Docker with Airflow and Spark standalone cluster☆262Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆131Updated last year
- Code for "Efficient Data Processing in Spark" Course☆361Updated 3 months ago
- Data Engineering on Google Cloud Platform☆380Updated last year
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆55Updated last year
- Execute DBT core on cloud run☆21Updated last year
- Supercharge BigQuery with BigFunctions☆759Updated 3 months ago
- A self-contained dbt project for testing purposes☆517Updated last year
- ☆10Updated 3 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆420Updated last week
- ☆146Updated last year
- Near real time ETL to populate a dashboard.☆73Updated 5 months ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆830Updated 3 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆245Updated 3 years ago