vishal-bulbule / etl-pipeline-datafusion-airflow
This repository contains code and configuration files for an Extract, Transform, Load (ETL) project using Google Cloud Data Fusion for data extraction, Apache Airflow/Composer for orchestration, and Google BigQuery for data loading.
☆13Updated last year
Alternatives and similar repositories for etl-pipeline-datafusion-airflow:
Users that are interested in etl-pipeline-datafusion-airflow are comparing it to the libraries listed below
- Demo Codes will be shared here☆46Updated 5 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆116Updated last year
- ☆13Updated 11 months ago
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆36Updated 11 months ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆190Updated last year
- ☆18Updated last year
- YouTube tutorial project☆101Updated last year
- ☆22Updated 3 years ago
- ☆136Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆121Updated 11 months ago
- ☆19Updated last year
- tokyo-olympic-azure-data-engineering-project☆199Updated 9 months ago
- Azure Data Factory☆61Updated 3 weeks ago
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆12Updated 7 months ago
- ☆16Updated last year
- This repo contains all the code used in the Python for Data Engineering Course☆281Updated last year
- ☆50Updated last year
- ☆195Updated last year
- Data Engineering on GCP☆35Updated 2 years ago
- Uber Data Engineering Pipeline using Mage AI and BigQuery☆20Updated 9 months ago
- Git Repository☆140Updated 2 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆129Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆138Updated last year
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆53Updated 3 months ago
- Realtime Data Engineering Project☆28Updated 3 months ago
- Data Engineering with Databricks Cookbook, published by Packt☆80Updated 10 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆34Updated last year
- ☆143Updated 11 months ago
- ☆12Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago