damklis / etljobView external linksLinks
Simple ETL pipeline using Python
☆29May 22, 2023Updated 2 years ago
Alternatives and similar repositories for etljob
Users that are interested in etljob are comparing it to the libraries listed below
Sorting:
- Swarming behaviour is based on aggregation of simple drones exhibiting basic instinctive reactions to stimuli. However, to achieve overal…☆12Dec 2, 2019Updated 6 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆17Feb 19, 2023Updated 2 years ago
- In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…☆12Sep 9, 2023Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆108Jan 8, 2026Updated last month
- Example end to end data engineering project.☆1,384Dec 8, 2022Updated 3 years ago
- Price Crawler - Tracking Price Inflation☆190Jun 23, 2020Updated 5 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 6 months ago
- End-to-end ELT data engineering project☆22Dec 24, 2022Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Jul 16, 2019Updated 6 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆25Feb 9, 2021Updated 5 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 4 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Apr 12, 2023Updated 2 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- Beginner data engineering project - batch edition☆564Jan 22, 2025Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆31Apr 2, 2023Updated 2 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated 11 months ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- ☆16Oct 8, 2025Updated 4 months ago
- Core Java Basic Program☆17Oct 30, 2025Updated 3 months ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated last year
- Final Project for Data Engineering Zoomcamp Course 2024 🧙 🔥☆11Apr 17, 2024Updated last year
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated 2 weeks ago
- Framework for studying cryptographic hash functions using SAT.☆10Dec 21, 2021Updated 4 years ago
- simple ETL example☆15Jun 1, 2020Updated 5 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆767Sep 3, 2024Updated last year
- A cold start recommender system for European travel destinations.☆10Dec 4, 2022Updated 3 years ago
- ☆10Jul 21, 2022Updated 3 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated 10 months ago