An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information
☆29Apr 22, 2022Updated 3 years ago
Alternatives and similar repositories for etl-pipeline-example
Users that are interested in etl-pipeline-example are comparing it to the libraries listed below
Sorting:
- A JS library and Svelte component that implements and visualizes a self-organizing map (SOM)☆14Apr 23, 2025Updated 10 months ago
- Dask on ECS Fargate☆14Sep 23, 2019Updated 6 years ago
- Find keyword in Twitter User followers/following☆10Aug 27, 2024Updated last year
- The repository for the NICAR 2024 class, SELECT * FROM interesting☆17Feb 2, 2024Updated 2 years ago
- ☆42Jan 11, 2024Updated 2 years ago
- This repo includes all exercises for courses and projects that I have finished on datacamp.☆20Feb 25, 2026Updated 3 weeks ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆19Aug 21, 2025Updated 7 months ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated this week
- Example code that launches a docker container on AWS Fargate from AWS Lambda☆18Dec 24, 2017Updated 8 years ago
- ☆22Jul 14, 2020Updated 5 years ago
- A super simple way to do distributed hyperparameter tuning with Keras and Mongo☆30Apr 16, 2017Updated 8 years ago
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- SnowShu is a sampling engine designed to support testing in data development.☆12Aug 26, 2025Updated 6 months ago
- A data pipeline helper written in node to convert a folder of JS/ArchieML/JSON/YAML/CSV/TSV files into usable data.☆47Sep 4, 2023Updated 2 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Sep 1, 2018Updated 7 years ago
- Machine Learning algorithms implementation in Python from scratch.☆11Feb 10, 2019Updated 7 years ago
- FInal project for data zoom camp 2024☆16Mar 31, 2024Updated last year
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆34Sep 14, 2017Updated 8 years ago
- Examples of causality maps for time series driven by GitHub actions☆15Nov 3, 2023Updated 2 years ago
- Python package for Plotly/Dash apps with support for multi-page, modules, and new charts such as Pareto with an Object Orient Approach☆20Aug 5, 2022Updated 3 years ago
- LSPosed module to send Monsieur Cuisine data to MQTT for Homeassistant.☆25Jul 10, 2025Updated 8 months ago
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆28Feb 5, 2024Updated 2 years ago
- ☆21Mar 31, 2024Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- EDB Reference Architectures☆16Jan 18, 2023Updated 3 years ago
- ☆39Aug 4, 2021Updated 4 years ago
- Documentation for Ploomber Cloud☆37Oct 1, 2025Updated 5 months ago
- ☆33Jan 8, 2021Updated 5 years ago
- A small tool for embedding files in a Go source file.☆11Nov 3, 2020Updated 5 years ago
- AWS Certified Solutions Architect Professional SAP-C01 New Feb 2019 Version Exam Notes☆17Apr 6, 2019Updated 6 years ago
- Pandas equivalent, high-performance dataframe library☆27Nov 11, 2025Updated 4 months ago
- ☆13May 22, 2023Updated 2 years ago
- A simple IP lookup tool written in Python with concurrency support.☆16Jul 12, 2025Updated 8 months ago
- A cool simple example of functional data engineering☆34Mar 13, 2023Updated 3 years ago
- Udacity's 5 Month Data Engineering Nanodegree program. This repo includes all the projects completed.☆27May 31, 2020Updated 5 years ago
- ☆10Nov 5, 2016Updated 9 years ago
- For when Safari goes wrong☆21Aug 9, 2014Updated 11 years ago
- Using MLflow with a Docker Environment☆19Sep 17, 2020Updated 5 years ago
- Microsynthesis using quasirandom sampling and/or IPF☆18Feb 14, 2026Updated last month