Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
☆32Feb 2, 2021Updated 5 years ago
Alternatives and similar repositories for Data_Engineering_Project_Portfolio
Users that are interested in Data_Engineering_Project_Portfolio are comparing it to the libraries listed below
Sorting:
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated last year
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆90Jul 17, 2019Updated 6 years ago
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- Airflow ETL for Meetup API☆45Dec 27, 2018Updated 7 years ago
- This is a comprehensive end-to-end data engineering project. I extracted data directly from YouTube in raw JSON format using Python and A…☆11Jun 4, 2024Updated last year
- The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing,…☆32Sep 7, 2021Updated 4 years ago
- This repository contains the "RFM Analysis" for a Sales Data of a Retailer in SQL. This is part of my Data Science Portfolio Projects☆10Jul 3, 2023Updated 2 years ago
- Codebasics Resume Project Challenge, Provide Insights to Revenue Team in Hospitality Domain☆10Sep 27, 2022Updated 3 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- This project provides Inventory Management using Power BI, extremely useful for Warehouse/ In-plant Inventory Managers to effectively con…☆13Feb 18, 2024Updated 2 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆39Sep 15, 2023Updated 2 years ago
- Data encoding library for Haskell.☆12Aug 4, 2023Updated 2 years ago
- Sticky Headers package for Framer X☆11Jul 9, 2020Updated 5 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- Headless commerce template based on T3 stack☆11Apr 6, 2023Updated 2 years ago
- ☆10May 24, 2021Updated 4 years ago
- Code for How to Add Product Reviews to Your Medusa Server and Next.js Storefront☆11Mar 8, 2023Updated 2 years ago
- ☆12Oct 31, 2023Updated 2 years ago
- A podcast transcription service built on Azure that transcribes any new episode of your podcast and displays synchronized transcripts alo…☆10Dec 10, 2022Updated 3 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Jul 7, 2021Updated 4 years ago
- My solutions to exercises from various SQL learning courses and platforms☆12Jun 23, 2021Updated 4 years ago
- ☆15Jul 20, 2023Updated 2 years ago
- Репозиторий содержит решение к задачам онлайн тренажеров по SQL с различных сайтов: sql-ex.ru learndb.ru hackerrank.com pgexercises.…☆10Jul 19, 2023Updated 2 years ago
- My professional portfolio with some of my best data science projects.☆11Jun 22, 2017Updated 8 years ago
- This repository contains tasks on how to build an ETL pipeline for the online transaction data of an e-commerce company.☆18Jun 27, 2023Updated 2 years ago
- "C" APIs for HBase☆11Dec 17, 2014Updated 11 years ago
- ☆11Sep 23, 2019Updated 6 years ago
- A curated list of awesome Deep Learning tutorials, projects and communities.☆11Oct 28, 2015Updated 10 years ago
- JP morgan virtual internship Quantitative Research☆21Dec 24, 2023Updated 2 years ago
- ☆17Mar 26, 2023Updated 2 years ago
- A project to develop a fully distributed MapReduce library for Haskell which makes using the MapReduce framework totally transparent for …☆20Nov 12, 2011Updated 14 years ago
- Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotli…☆16Dec 3, 2018Updated 7 years ago
- Решения задач acmp.ru / Язык: C++11☆10Jun 13, 2020Updated 5 years ago
- Repo will try to cover all the most frequently used ML algos with proper explanation and examples☆10Apr 14, 2019Updated 6 years ago
- This project would demonstrate the following capabilities: 1. Extraction Loading and Transformation of S&P 500 data and company fundament…☆14Sep 26, 2021Updated 4 years ago