pawl / awesome-etl
A curated list of awesome ETL frameworks, libraries, and software.
☆3,319Updated 5 months ago
Alternatives and similar repositories for awesome-etl:
Users that are interested in awesome-etl are comparing it to the libraries listed below
- Curated list of resources about Apache Airflow☆3,722Updated 4 months ago
- ETL best practices with airflow, with examples☆1,313Updated 3 months ago
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,241Updated last month
- Actively curated list of awesome BI tools. PRs welcome!☆2,115Updated 4 months ago
- Python Extract Transform and Load Tables of Data☆1,254Updated 8 months ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,081Updated last year
- A curated list of data engineering tools for software developers☆6,964Updated 2 months ago
- a curated list of awesome streaming frameworks, applications, etc☆2,742Updated 2 weeks ago
- Extract Transform Load for Python 3.5+☆1,589Updated last year
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,474Updated last week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,024Updated this week
- Data-Centric Pipelines and Data Versioning☆6,197Updated this week
- Guides and docs to help you get up and running with Apache Airflow.☆804Updated 2 years ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,499Updated 4 months ago
- A list of useful Apache NiFi resources, processor bundles and tools☆948Updated 4 years ago
- The leader in Next-Generation Customer Data Infrastructure☆6,868Updated 4 months ago
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,697Updated 7 months ago
- A curated list of data engineering tools for software developers☆465Updated 7 years ago
- ☆1,619Updated this week
- This repository is a getting started guide to Singer.☆1,282Updated 4 months ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,488Updated last month
- Dynamically generate Apache Airflow DAGs from YAML configuration files☆1,231Updated this week
- [NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis☆1,487Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,973Updated this week
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆13,382Updated 8 months ago
- [NOT MAINTAINED] Bubbles – Python ETL framework☆453Updated 7 years ago
- Docker Apache Airflow☆3,790Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆12,300Updated this week
- Pinball is a scalable workflow manager☆1,045Updated 5 years ago
- NumPy and Pandas interface to Big Data☆3,190Updated last year