Simple ETL pipeline using Python
☆29May 22, 2023Updated 3 years ago
Alternatives and similar repositories for etljob
Users that are interested in etljob are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ETL using Python in Jupyter Notebook, loading CSV, cleaning data, and saving to SQL Database.☆14Nov 17, 2020Updated 5 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆114Jan 8, 2026Updated 5 months ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated 2 years ago
- Swarming behaviour is based on aggregation of simple drones exhibiting basic instinctive reactions to stimuli. However, to achieve overal…☆11Dec 2, 2019Updated 6 years ago
- Example end to end data engineering project.☆1,411Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- simple ETL example☆16Jun 1, 2020Updated 6 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Automated ML pipeline with Python, Docker, Luigi, SciKit-Learn and Pandas to predict wine quality ratings☆18May 30, 2020Updated 6 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆16Jun 11, 2026Updated last week
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆24Nov 19, 2024Updated last year
- Price Crawler - Tracking Price Inflation☆205Jun 23, 2020Updated 5 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Mar 24, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains tasks on how to build an ETL pipeline for the online transaction data of an e-commerce company.☆18Jun 27, 2023Updated 2 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆19Feb 19, 2023Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 8 months ago
- Machine Learning for Internet of Things☆12Jul 24, 2019Updated 6 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 10 months ago
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆32Apr 2, 2023Updated 3 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- A simple to use 4 polyphonic wavetable synthesizer library for Arduino.☆14Feb 11, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆26Feb 9, 2021Updated 5 years ago
- ⚡️ Pandas dataframes with object oriented programming style (not maintained)☆11Mar 17, 2024Updated 2 years ago
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆14Apr 15, 2026Updated 2 months ago
- 🌟 An end-to-end full-stack data science project, including modelling, MLOps, and data storytelling. ✨☆16Aug 30, 2025Updated 9 months ago
- ☆16Jul 15, 2023Updated 2 years ago
- Hobbyist OpenGL from Python☆15Jul 28, 2023Updated 2 years ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Step by step instructions to create a production-ready data pipeline☆62Dec 23, 2024Updated last year
- A simple peristaltic pump, with bearings and 3d printed parts for a NEMA17☆16Oct 2, 2022Updated 3 years ago
- My final project for the Data Engineering Zoomcamp by DataTalksClub.☆10Apr 6, 2023Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆12Jul 16, 2019Updated 6 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆804Mar 10, 2026Updated 3 months ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Aug 18, 2021Updated 4 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆29Apr 12, 2023Updated 3 years ago