A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
☆73Sep 20, 2019Updated 6 years ago
Alternatives and similar repositories for airflow-spark-operator-plugin
Users that are interested in airflow-spark-operator-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176May 28, 2025Updated 10 months ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Dec 16, 2020Updated 5 years ago
- A plugin to Apache Airflow to allow you to run Zip and UnZip commands as an Operator☆12Jul 26, 2023Updated 2 years ago
- An Airflow Plugin that provides a new page to the standard Airflow Web Server to help you perform various operations☆12Nov 28, 2016Updated 9 years ago
- Ansible role to deploy and configure Airflow☆41Mar 31, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Nov 18, 2019Updated 6 years ago
- Airflow configuration for Telemetry☆201Updated this week
- Airflow declarative DAGs via YAML☆133Sep 18, 2023Updated 2 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Aug 24, 2022Updated 3 years ago
- A demo repo for using keylines with IBM Graph☆11Dec 20, 2016Updated 9 years ago
- Example for an airflow plugin☆49Jul 19, 2016Updated 9 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Jan 17, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Experiments produced during an end of studies project (ETS, H2018)☆14Nov 21, 2018Updated 7 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- Dockerfile for a base Logstash image to be extended by others (allow to install plug-ins, change configuration, etc.)☆10Jan 16, 2017Updated 9 years ago
- A focused web crawler based on Playwright, RMQ, Kafka and Flink.☆14Feb 4, 2021Updated 5 years ago
- Kong Serverless Plugins - this plugin has been moved into https://github.com/Kong/kong, please open issues and PRs in that repo☆17Sep 1, 2021Updated 4 years ago
- A pivot table plugin for Kibana 5☆24Aug 17, 2018Updated 7 years ago
- Music Recommendation using Python + Spark☆23Oct 14, 2020Updated 5 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Aug 7, 2017Updated 8 years ago
- Source code for SIMD benchmarks and experiments in Java☆32Jun 30, 2017Updated 8 years ago
- A HDFS-backed ContentsManager implementation for IPython☆24Nov 13, 2024Updated last year
- ETL best practices with airflow, with examples☆1,352Sep 25, 2024Updated last year
- ☆21Mar 13, 2020Updated 6 years ago
- Functional Airflow DAG definitions.☆38Jul 4, 2017Updated 8 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Airflow-Salesforce connector☆16Jul 5, 2017Updated 8 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Few things we've met during our etl project based on spark☆24Mar 22, 2018Updated 8 years ago
- A library for querying Druid data sources with Apache Spark☆23Oct 28, 2020Updated 5 years ago
- A plugin for Airflow that create and manage your DAG with web UI.☆20Nov 28, 2017Updated 8 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Jul 19, 2023Updated 2 years ago
- Python library for sending data to Honeycomb☆20Nov 25, 2024Updated last year
- Turbine: the bare metals that gets you Airflow☆378Oct 10, 2021Updated 4 years ago