prakashdontaraju / google-cloud-ecommerce
ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipeline ― Cloud Storage, Dataproc, PySpark, Cloud Spanner and Tableau
☆11Updated 3 years ago
Alternatives and similar repositories for google-cloud-ecommerce
Users that are interested in google-cloud-ecommerce are comparing it to the libraries listed below
Sorting:
- My applied big data analytic project with pyspark.☆10Updated 2 years ago
- All the Snowflake Virtual Warehouse - Example☆12Updated 4 years ago
- Cloned by the `dbt init` task☆61Updated last year
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆22Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆19Updated 2 years ago
- ☆14Updated 6 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- ☆28Updated 7 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- ☆21Updated last year
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆24Updated 2 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆37Updated 5 years ago
- A streaming ETL pipeline for Realtime Tweet Collection, Analysis and Reporting☆9Updated 3 years ago
- ☆87Updated 2 years ago
- A curated list of awesome Snowflake analytic data warehouse learning resources☆20Updated 4 years ago
- dbt-github-workflow is a boilerplate that contains all the necessary configurations to set up a simple CI/CD pipeline for your data model…☆17Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Simple stream processing pipeline☆102Updated 11 months ago
- End to end data engineering project☆54Updated 2 years ago
- Simple ETL pipeline using Python☆26Updated last year
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆33Updated 4 years ago
- This repo contains commands that data engineers use in day to day work.☆60Updated 2 years ago
- ☆23Updated 2 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆16Updated 6 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- The dbt adapter for Firebolt☆29Updated 4 months ago
- dbt plugin for Palm CLI☆21Updated last year