An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports
☆25Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for Prescriber-ETL-data-pipeline
Users that are interested in Prescriber-ETL-data-pipeline are comparing it to the libraries listed below
Sorting:
- This is an analytical project done using python to process and extract valuable insights from WhatsApp text file, deployed as a webapp us…☆19Dec 8, 2023Updated 2 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- Backend: No more asking where my money goes.☆10Jan 4, 2023Updated 3 years ago
- ☆12Jul 27, 2021Updated 4 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆11Jan 24, 2021Updated 5 years ago
- The Missing check on Laravel Request Unicity.☆10Oct 23, 2023Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆12Nov 6, 2023Updated 2 years ago
- Reusable OpenAI secure UI and infrastructure for AI Chat with Azure☆18Feb 16, 2026Updated 3 weeks ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- BlaBlaConf's Demo to understand FastAPI, ReactJS, MongoDB ✨☆16Oct 29, 2021Updated 4 years ago
- a simple operating system, that will grow as I learn more osdev☆11Oct 24, 2023Updated 2 years ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆11May 25, 2023Updated 2 years ago
- In this project first we fetch data of any stock(NSE) in realtime then we evaluate the stock price using basics visualizations then we…☆12Mar 24, 2023Updated 2 years ago
- Drag & drop UI to build your customized LLM flow☆13Updated this week
- In this tutorial, we have added step-by-step instructions to build your own AI chatbot with ChatGPT API. From setting up tools to install…☆11Apr 13, 2023Updated 2 years ago
- Implementations of transformer models in pytorch☆14Jun 2, 2020Updated 5 years ago
- ☆17Jan 12, 2026Updated last month
- Data wrangling and Feature analysis based on the Netflix userbase sample Dataset.☆14Dec 13, 2023Updated 2 years ago
- ☆11Jan 15, 2019Updated 7 years ago
- Complex Data Extraction with LLMs☆10Nov 4, 2024Updated last year
- ☆15May 7, 2025Updated 10 months ago
- Splunk Add-on for Microsoft Azure☆11Dec 15, 2025Updated 2 months ago
- Sample Drupal 8 site running on Kubernetes at Digital Ocean☆11Dec 10, 2022Updated 3 years ago
- ☆12Feb 6, 2023Updated 3 years ago
- Module for pipelines concept in PySpark☆16Mar 27, 2024Updated last year
- ☆11Jul 13, 2020Updated 5 years ago
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago
- National Stock Exchange (India) (nseindia.com) Web-Scraping For collecting data for real-time visualization and machine learning projects…☆16Aug 11, 2024Updated last year
- i tried to solve as many tasks as possible to make my SQL skills better☆14Apr 26, 2024Updated last year
- This repository contains tasks on how to build an ETL pipeline for the online transaction data of an e-commerce company.☆18Jun 27, 2023Updated 2 years ago
- ☆12May 19, 2021Updated 4 years ago
- ETL using Python in Jupyter Notebook, loading CSV, cleaning data, and saving to SQL Database.☆14Nov 17, 2020Updated 5 years ago
- Scipy main repository☆12May 10, 2024Updated last year
- In this repository, I have curated all the materials I used to study Machine learning Algorithms.☆18Updated this week
- The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-m…☆15Jul 9, 2021Updated 4 years ago
- Numpy main repository☆13Oct 22, 2025Updated 4 months ago
- List of interesting links about ML Algorithms, Data Science, Network Analysis, and others.☆12May 9, 2023Updated 2 years ago