BasPH / airflow-rocket
Airflow code accompanying blog post.
☆21Updated 6 years ago
Alternatives and similar repositories for airflow-rocket:
Users that are interested in airflow-rocket are comparing it to the libraries listed below
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated last week
- event-triggered plugins for airflow☆21Updated 5 years ago
- ☆24Updated 5 years ago
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 3 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- ☆19Updated last month
- Spark Application UI extension for JupyterLab☆10Updated 3 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- pytest support for airflow☆12Updated 4 years ago
- Using the Parquet file format with Python☆15Updated last year
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- A pyspark lib to validate data quality☆18Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 3 months ago
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago