vishal-bulbule / etl-pipeline-datafusion-airflow
This repository contains code and configuration files for an Extract, Transform, Load (ETL) project using Google Cloud Data Fusion for data extraction, Apache Airflow/Composer for orchestration, and Google BigQuery for data loading.
☆12Updated 11 months ago
Alternatives and similar repositories for etl-pipeline-datafusion-airflow:
Users that are interested in etl-pipeline-datafusion-airflow are comparing it to the libraries listed below
- Demo Codes will be shared here☆44Updated 3 months ago
- ☆15Updated 10 months ago
- ☆135Updated 2 years ago
- YouTube tutorial project☆98Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆179Updated last year
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆11Updated 5 months ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- apache-spark-with-databricks-for-data-engineering☆70Updated 7 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆117Updated last year
- ☆16Updated last year
- ☆22Updated 3 years ago
- tokyo-olympic-azure-data-engineering-project☆185Updated 7 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆79Updated 6 months ago
- Public data and analytics for our open course☆31Updated 10 months ago
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆67Updated last year
- ☆19Updated last year
- Uber Data Engineering Pipeline using Mage AI and BigQuery☆19Updated 6 months ago
- ☆149Updated 2 years ago
- This project leverages GCS, Composer, Dataflow, BigQuery, and Looker on Google Cloud Platform (GCP) to build a robust data engineering so…☆22Updated last year
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Updated last year
- Azure Data Factory☆57Updated this week
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆30Updated 9 months ago
- This repo contains all the code used in the Python for Data Engineering Course☆252Updated 9 months ago
- ☆18Updated 7 months ago
- ☆10Updated 10 months ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆52Updated last month
- Sample repo for startdataengineering DE 101 free course☆48Updated 7 months ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆46Updated 2 years ago