pawl / awesome-etl
A curated list of awesome ETL frameworks, libraries, and software.
☆3,397Updated 9 months ago
Alternatives and similar repositories for awesome-etl:
Users that are interested in awesome-etl are comparing it to the libraries listed below
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,083Updated last year
- Curated list of resources about Apache Airflow☆3,773Updated 8 months ago
- ETL best practices with airflow, with examples☆1,332Updated 7 months ago
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,352Updated last month
- A curated list of data engineering tools for software developers☆7,346Updated 2 weeks ago
- Actively curated list of awesome BI tools. PRs welcome!☆2,159Updated 8 months ago
- Extract Transform Load for Python 3.5+☆1,589Updated last year
- a curated list of awesome streaming frameworks, applications, etc☆2,808Updated last month
- Python Extract Transform and Load Tables of Data☆1,266Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,566Updated last week
- A list of useful resources to learn Data Engineering from scratch☆3,774Updated 10 months ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,507Updated 5 months ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,373Updated 5 years ago
- A curated list of data engineering tools for software developers☆482Updated 7 years ago
- This repository is a getting started guide to Singer.☆1,300Updated 8 months ago
- An Awesome List of Open-Source Data Engineering Projects☆2,506Updated 7 months ago
- A list of useful Apache NiFi resources, processor bundles and tools☆953Updated 4 years ago
- Guides and docs to help you get up and running with Apache Airflow.☆808Updated 2 years ago
- Web-based SQL editor. Legacy project in maintenance mode.☆5,119Updated last month
- Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized co…☆1,771Updated 4 years ago
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆13,594Updated 2 months ago
- Official repository for pygrametl - ETL programming in Python☆295Updated last week
- A curated list of awesome Apache Spark packages and resources.☆1,793Updated 6 months ago
- Docker Apache Airflow☆3,803Updated 2 years ago
- Apache Pinot - A realtime distributed OLAP datastore☆5,764Updated this week
- High-performance time-series aggregation for PostgreSQL☆2,642Updated 3 years ago
- Construct Apache Airflow DAGs Declaratively via YAML configuration files☆1,286Updated last week
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,722Updated 10 months ago
- Data-Centric Pipelines and Data Versioning☆6,223Updated 3 months ago
- Dremio - the missing link in modern data☆1,427Updated last week