Productionalizing Data Pipelines with Apache Airflow
☆116Jun 18, 2022Updated 3 years ago
Alternatives and similar repositories for productionalizing-data-pipelines-airflow
Users that are interested in productionalizing-data-pipelines-airflow are comparing it to the libraries listed below
Sorting:
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- ☆178Dec 8, 2022Updated 3 years ago
- ☆12Mar 17, 2022Updated 4 years ago
- ☆11Nov 26, 2024Updated last year
- This is where we put useful code for our daily job with data.☆27Mar 19, 2025Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- A tool for customizing and dynamically generating a tabular model from an existing tabular model.☆24Sep 16, 2022Updated 3 years ago
- ☆10May 5, 2022Updated 3 years ago
- ☆19Mar 18, 2021Updated 5 years ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- AWS API Gateway Security Deep dive☆22Apr 7, 2021Updated 4 years ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Nov 16, 2021Updated 4 years ago
- ☆23Nov 17, 2019Updated 6 years ago
- Various useful data structures in Python☆39Nov 14, 2019Updated 6 years ago
- Spark cluster in docker containers with sample training Jupyter notebooks☆27Feb 24, 2023Updated 3 years ago
- Code for Data Pipelines with Apache Airflow☆815Aug 15, 2024Updated last year
- Running a flask app and celery worker in the same docker container.☆31Feb 20, 2020Updated 6 years ago
- ☆11May 26, 2022Updated 3 years ago
- ☆17Nov 26, 2024Updated last year
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆27Mar 1, 2026Updated 2 weeks ago
- Code and data for Veridicality classifier on Twitter☆11May 23, 2018Updated 7 years ago
- Demo for GitHub Universe 2022☆13Jan 31, 2023Updated 3 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Demo that extends the FastUI example & adds database persistence☆16Jan 2, 2024Updated 2 years ago
- Airflow Tutorials☆25Feb 28, 2021Updated 5 years ago
- ☆17Aug 19, 2022Updated 3 years ago
- NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)☆16May 20, 2023Updated 2 years ago
- advent of code with SQL☆12Dec 16, 2020Updated 5 years ago
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆11Jul 12, 2022Updated 3 years ago
- Stream Processing Workshop☆23Jan 26, 2026Updated last month
- Proyecto de juguete para mostrar cómo realizar el setup de un proyecto de data science☆11Nov 24, 2022Updated 3 years ago
- ☆10Jul 14, 2022Updated 3 years ago
- Change Data Capture (CDC) from PostgreSQL to ClickHouse☆16Jul 15, 2024Updated last year
- Collection of Snowflake Scripting procedures extending GET_DDL function by dwh.dev.☆15Jul 23, 2024Updated last year
- Indicator chart plugin for Apache Superset☆14Nov 21, 2022Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- ☆13Nov 8, 2022Updated 3 years ago
- A Simple React App using Express Backend and Interacting with BigQuery: Most Viewed StackOverflow Questions☆17May 10, 2018Updated 7 years ago