A curated list of data engineering tools for software developers
☆503Jun 23, 2017Updated 8 years ago
Alternatives and similar repositories for awesome-data-engineering
Users that are interested in awesome-data-engineering are comparing it to the libraries listed below
Sorting:
- A curated list of data engineering tools for software developers☆8,366Feb 21, 2026Updated 2 weeks ago
- Curated list of resources about Apache Airflow☆3,896Jan 30, 2026Updated last month
- A Github API client to extract events and actions, and load into a database☆28Oct 22, 2021Updated 4 years ago
- ☆201Oct 10, 2023Updated 2 years ago
- ETL best practices with airflow, with examples☆1,352Sep 25, 2024Updated last year
- Airflow training for the crunch conf☆105Oct 31, 2018Updated 7 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Jul 19, 2023Updated 2 years ago
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated last year
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,510Updated this week
- Always know what to expect from your data.☆11,224Updated this week
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 4 years ago
- Single Sign On System for SPARCS☆19Feb 10, 2026Updated last month
- Apache Airflow CI pipeline☆19Jun 12, 2019Updated 6 years ago
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,770Jun 18, 2024Updated last year
- The Data Engineering Cookbook☆14,977Jan 17, 2026Updated last month
- Docker Apache Airflow☆3,808Mar 1, 2023Updated 3 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Aug 17, 2022Updated 3 years ago
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,746Updated this week
- Automatically creates dbt exposures from your BI tools. It currently supports Tableau (connecting to Snowflake).☆62Jan 25, 2024Updated 2 years ago
- Tutorial for interacting with Google Cloud Storage via the Python SDK.☆24Mar 4, 2026Updated last week
- A list of useful resources to learn Data Engineering from scratch☆3,960Jun 19, 2024Updated last year
- ☆10Jun 30, 2022Updated 3 years ago
- Wiki and snippets in web stack architecture (Especially for Django and AWS)☆11Feb 18, 2019Updated 7 years ago
- Palm CLI - the tool-belt for data teams☆47Mar 20, 2024Updated last year
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Feb 18, 2021Updated 5 years ago
- ☆25Feb 14, 2025Updated last year
- Apache Superset is a Data Visualization and Data Exploration Platform☆70,860Updated this week
- Power BI Custom Connector for loading tables directly from Tabular Data Packages (Frictionless Data) into Power BI☆10Jun 16, 2020Updated 5 years ago
- create buckets for terraform tfstate files and set cross-region replication.☆12Sep 17, 2017Updated 8 years ago
- Use SQL to instantly query plugin metadata from the Steampipe Hub. Open source CLI. No DB required.☆12Feb 9, 2026Updated last month
- Skeleton project for Apache Airflow training participants to work on.☆17Jul 9, 2020Updated 5 years ago
- dbt + steampipe playground☆10Nov 5, 2022Updated 3 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Nov 15, 2022Updated 3 years ago
- Kubernetes Fundamentals Book☆14Feb 5, 2019Updated 7 years ago
- Homebrew Tap for argo☆18Aug 28, 2025Updated 6 months ago
- Authentication and CORS helper for Python cloud functions☆11Jun 12, 2019Updated 6 years ago
- Episode 147 - Continuous Integration and Delivery with Cloud Build + Firebase☆17Jan 18, 2019Updated 7 years ago
- Run dbt serverless in the Cloud (AWS)☆43Jan 20, 2020Updated 6 years ago
- A Helm chart to install Apache Airflow on Kubernetes☆295Mar 4, 2026Updated last week