ryanchao2012 / airfly
Auto Generate Airflow's dag.py On The Fly
☆9Updated this week
Alternatives and similar repositories for airfly:
Users that are interested in airfly are comparing it to the libraries listed below
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Nuclio Function Automation for Python and Jupyter☆84Updated 2 months ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆49Updated last year
- Code Repository for the EVO-ODAS☆31Updated 7 years ago
- python library for automated dataset normalization☆113Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 4 months ago
- The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimators☆52Updated last year
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- ☆11Updated 5 years ago
- Gather system information about airflow processes☆18Updated 4 years ago
- Data Catalog for Databases and Data Warehouses☆32Updated last year
- dagster scikit-learn pipeline example.☆44Updated last year
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- A containerized approach using Apache Kafka, Spark, Cassandra, Hive, Jupyter, and Docker-compose.☆14Updated 3 years ago
- A library for Spark DataFrame using MinIO Select API☆97Updated 5 years ago
- Machine Learning Projects with Flytekit☆35Updated last year
- Helm charts for Dask☆92Updated this week
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 2 years ago
- A cloud native data mesh implementation☆12Updated 4 years ago
- A curated list of dagster code snippets for data engineers☆53Updated 11 months ago
- Demo of an In-database processing tool for scikit-learn☆13Updated 2 years ago
- ☆30Updated 3 years ago