A repo to track data engineering projects
☆13Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for dataEngineering
Users that are interested in dataEngineering are comparing it to the libraries listed below
Sorting:
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 4 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 6 months ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Jul 16, 2019Updated 6 years ago
- ☆23Nov 8, 2022Updated 3 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 2 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3☆32Feb 2, 2021Updated 5 years ago
- Big Data Engineering & Analytics Project☆36Nov 6, 2020Updated 5 years ago
- This repo has some proposed agenda for Azure Machine Learning related hands-on workshops.☆11Feb 2, 2021Updated 5 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated last month
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.☆10Mar 13, 2023Updated 2 years ago
- Some example projects for Data Engineers to build, end-to-end.☆38Nov 8, 2023Updated 2 years ago
- This is the end to end MLOps project I built through participated the MLOps Zoomcamp☆10Sep 11, 2022Updated 3 years ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- This repo contains example code used for golang training☆10Feb 19, 2023Updated 3 years ago
- This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…☆17Feb 2, 2024Updated 2 years ago
- Creating character network graphs☆39May 15, 2020Updated 5 years ago
- An experimental attempt to make a CLI for supply-chain modeling for Helpful Engineering's Project Data☆10Oct 29, 2023Updated 2 years ago
- Generate fake data for any purpose☆10Dec 21, 2020Updated 5 years ago
- My personal website☆11Jan 31, 2026Updated last month
- Some Monte Carlo algorithms for the estimation of small probabilities associated with rare events☆11Aug 16, 2023Updated 2 years ago
- Swarming behaviour is based on aggregation of simple drones exhibiting basic instinctive reactions to stimuli. However, to achieve overal…☆12Dec 2, 2019Updated 6 years ago
- Sliding Puzzle solver and utilities☆10Jan 21, 2024Updated 2 years ago
- Introduction to network analysis and visualization☆12Apr 6, 2024Updated last year
- ☆10May 24, 2021Updated 4 years ago
- ☆10Feb 25, 2023Updated 3 years ago
- ☆10May 5, 2022Updated 3 years ago
- Business Rules Integration Engine☆11Updated this week
- Business challenge that requires building a data platform for retailer data analytics.☆17Feb 19, 2023Updated 3 years ago