prakashdontaraju / google-cloud-ecommerce
ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipeline ― Cloud Storage, Dataproc, PySpark, Cloud Spanner and Tableau
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for google-cloud-ecommerce
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆22Updated 2 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- End to end data engineering project☆51Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Updated 2 years ago
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆37Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- Repository for Data Engineering Interview Series☆21Updated last month
- Example repo to create end to end tests for data pipeline.☆21Updated 5 months ago
- Common GitHub actions and workflows for maintaining dbt☆12Updated this week
- All the Snowflake Virtual Warehouse - Example☆11Updated 4 years ago
- Simple ETL pipeline using Python☆21Updated last year
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆21Updated 3 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆24Updated 3 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆26Updated 3 years ago
- F1 Data Pipeline☆23Updated last year
- Macros for generating dbt model data profiles☆81Updated last month
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- My applied big data analytic project with pyspark.☆10Updated 2 years ago
- dbt plugin for Palm CLI☆21Updated 8 months ago
- ☆11Updated 3 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆11Updated 2 years ago
- Simple stream processing pipeline☆92Updated 5 months ago
- ☆20Updated 8 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Data models for Hubspot built using dbt.☆33Updated last week
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- A custom end-to-end data pipeline for customer churn☆9Updated 3 weeks ago
- This repo contains commands that data engineers use in day to day work.☆59Updated last year