judeleonard / Prescriber-ETL-data-pipelineView external linksLinks
An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
☆25Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for Prescriber-ETL-data-pipeline
Users that are interested in Prescriber-ETL-data-pipeline are comparing it to the libraries listed below
Sorting:
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Backend: No more asking where my money goes.☆10Jan 4, 2023Updated 3 years ago
- The Missing check on Laravel Request Unicity.☆10Oct 23, 2023Updated 2 years ago
- Reusable OpenAI secure UI and infrastructure for AI Chat with Azure☆18Aug 4, 2025Updated 6 months ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆11Jan 24, 2021Updated 5 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆11Nov 6, 2023Updated 2 years ago
- A Machine Learning project for Machine Learning Internship offered by InternshipStudio.☆12Aug 8, 2021Updated 4 years ago
- BlaBlaConf's Demo to understand FastAPI, ReactJS, MongoDB ✨☆16Oct 29, 2021Updated 4 years ago
- PyTorch Global Summer Hackathon 2020 Second Prize Winner in Mobile/Web Application Category☆10Aug 24, 2025Updated 5 months ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- Drag & drop UI to build your customized LLM flow☆13Updated this week
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆12Oct 18, 2025Updated 3 months ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- Data Structures in Python☆10Jan 19, 2026Updated 3 weeks ago
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆11May 25, 2023Updated 2 years ago
- In this tutorial, we have added step-by-step instructions to build your own AI chatbot with ChatGPT API. From setting up tools to install…☆11Apr 13, 2023Updated 2 years ago
- Claude Code Review Skill☆27Jan 9, 2026Updated last month
- This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)☆11Apr 29, 2022Updated 3 years ago
- Sample Drupal 8 site running on Kubernetes at Digital Ocean☆11Dec 10, 2022Updated 3 years ago
- the full pipeline for model retraining with fastapi and github actions☆16Jul 5, 2024Updated last year
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆50Oct 13, 2024Updated last year
- ☆13Jul 15, 2023Updated 2 years ago
- This project promulgates an automated end-to-end ML pipeline that trains a biLSTM network for sentiment analysis, experiment tracking, be…☆15Feb 1, 2023Updated 3 years ago
- ☆12Dec 28, 2025Updated last month
- Scipy main repository☆12May 10, 2024Updated last year
- The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-m…☆14Jul 9, 2021Updated 4 years ago
- A travel app using native device features, React Navigation, Redux and Redux-Thunk.☆15Feb 16, 2023Updated 3 years ago
- Numpy main repository☆13Oct 22, 2025Updated 3 months ago
- Machine Learning Course☆17Mar 29, 2025Updated 10 months ago
- In this project, we will be deploying a Kubernetes cluster using a Jenkins CI/CD pipeline. We will be utilizing various DevOps tools such…☆14Jun 6, 2023Updated 2 years ago
- A Next js boilerplate for authentication☆18Mar 31, 2021Updated 4 years ago
- XCloud Project's objective is to build similar infrastructure on both AWS and GCP to have multi cloud infrastructure for an organization …☆13Aug 1, 2022Updated 3 years ago
- ☆13Feb 18, 2022Updated 3 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆50Aug 23, 2019Updated 6 years ago
- ☆14Mar 11, 2023Updated 2 years ago
- ☆16Sep 24, 2023Updated 2 years ago
- A series of experiments to learn about microservices☆11Sep 10, 2022Updated 3 years ago
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- It is a project that aim to detect and classify hate speech and offensive speech on Twitter using bag of words model. The ipython noteboo…☆12Oct 29, 2018Updated 7 years ago