minhky2185 / healthcare_data_pipeline
An end-to-end data pipeline for building Data Lake and supporting report using Apache Spark.
☆10Updated last year
Alternatives and similar repositories for healthcare_data_pipeline:
Users that are interested in healthcare_data_pipeline are comparing it to the libraries listed below
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- YouTube tutorial project☆97Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆24Updated last year
- ☆31Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆61Updated 7 months ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Updated last year
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆29Updated 3 years ago
- This repo contains all code and data for WWCode Python DE workshop Aug 18 and 25 2022☆24Updated 2 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆16Updated 11 months ago
- ☆19Updated last year
- ☆61Updated last week
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆76Updated 5 months ago
- ☆40Updated 6 months ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆93Updated last year
- End to end data engineering project☆53Updated 2 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆43Updated 5 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆13Updated this week
- Data Engineering YouTube Analysis Project by Darshil Parmar☆168Updated last year
- Example repo to create end to end tests for data pipeline.☆21Updated 7 months ago
- These are projects for healthcare analytics. The projects are based on open data on health care.☆10Updated 4 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆14Updated 3 years ago
- End-to-end data platform leveraging the Modern data stack☆43Updated 9 months ago
- ☆129Updated last year
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆10Updated last year