Airflow training for the crunch conf
☆105Oct 31, 2018Updated 7 years ago
Alternatives and similar repositories for airflow-training
Users that are interested in airflow-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Curated list of resources about Apache Airflow☆3,902Apr 20, 2026Updated 2 weeks ago
- Visualize dependencies between Airflow DAGs☆49May 7, 2021Updated 4 years ago
- Airflow basics tutorial☆397Sep 1, 2021Updated 4 years ago
- Airflow Unit Tests and Integration Tests☆262Nov 16, 2022Updated 3 years ago
- ☆29Sep 30, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Example DAGs using hooks and operators from Airflow Plugins☆347Jul 24, 2018Updated 7 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Jan 6, 2021Updated 5 years ago
- Demo setup for the PyCon 2017 keynote.☆20Sep 2, 2017Updated 8 years ago
- MIT xPRO data science course☆12Jul 26, 2018Updated 7 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Aug 24, 2022Updated 3 years ago
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- Market basket recommendations using association rules and apriori☆11Aug 25, 2018Updated 7 years ago
- A repository to store recipes, custom sources, transformations and other things to make your DataHub experience magical☆12Sep 23, 2022Updated 3 years ago
- Apache Airflow tutorial☆974Nov 3, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An example MLFlow project☆52Jan 10, 2025Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆88Dec 8, 2022Updated 3 years ago
- Turbine: the bare metals that gets you Airflow☆380Oct 10, 2021Updated 4 years ago
- ☆10Dec 30, 2024Updated last year
- A curated list of data engineering tools for software developers☆506Jun 23, 2017Updated 8 years ago
- A simple dashboard for monitoring your aws codepipelines.☆11May 8, 2018Updated 7 years ago
- Materials used in class when teaching Data Bootcamp at NYU Stern.☆26May 1, 2018Updated 8 years ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Mar 4, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Jan 28, 2025Updated last year
- This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed t…☆50Feb 11, 2025Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- Code for Data Pipelines with Apache Airflow☆819Aug 15, 2024Updated last year
- Basic tutorial of using Apache Airflow☆36Sep 25, 2018Updated 7 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆54Mar 26, 2026Updated last month
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- Repositório com arquivos dos meetups que fazemos em http://www.meetup.com/Quebrando-a-cabeca-no-Kaggle/☆18Dec 4, 2015Updated 10 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176Apr 13, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- This is a public repository that the dbt proserv team uses for collective demos.☆15Mar 20, 2026Updated last month
- Construct Apache Airflow DAGs Declaratively via YAML configuration files☆1,428Updated this week
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- All the code developed in the "Creating Google Cloud Pub/Sub publishers and subscribers with Spring Cloud GCP" article.☆10May 25, 2023Updated 2 years ago