A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
☆73Sep 20, 2019Updated 6 years ago
Alternatives and similar repositories for airflow-spark-operator-plugin
Users that are interested in airflow-spark-operator-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176Apr 13, 2026Updated last month
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Dec 16, 2020Updated 5 years ago
- A plugin to Apache Airflow to allow you to run Zip and UnZip commands as an Operator☆12Jul 26, 2023Updated 2 years ago
- An Airflow Plugin that provides a new page to the standard Airflow Web Server to help you perform various operations☆12Nov 28, 2016Updated 9 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Nov 18, 2019Updated 6 years ago
- Airflow configuration for Telemetry☆202May 11, 2026Updated last week
- Airflow declarative DAGs via YAML☆133Sep 18, 2023Updated 2 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Aug 24, 2022Updated 3 years ago
- A demo repo for using keylines with IBM Graph☆11Dec 20, 2016Updated 9 years ago
- Example for an airflow plugin☆49Jul 19, 2016Updated 9 years ago
- A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).☆18Apr 20, 2024Updated 2 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Jan 17, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Airflow workflow management platform chef cookbook.☆70Jul 11, 2019Updated 6 years ago
- Experiments produced during an end of studies project (ETS, H2018)☆14Nov 21, 2018Updated 7 years ago
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- Dockerfile for a base Logstash image to be extended by others (allow to install plug-ins, change configuration, etc.)☆10Jan 16, 2017Updated 9 years ago
- Kong Serverless Plugins - this plugin has been moved into https://github.com/Kong/kong, please open issues and PRs in that repo☆17Sep 1, 2021Updated 4 years ago
- A pivot table plugin for Kibana 5☆24Aug 17, 2018Updated 7 years ago
- Music Recommendation using Python + Spark☆23Oct 14, 2020Updated 5 years ago
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Aug 7, 2017Updated 8 years ago
- Source code for SIMD benchmarks and experiments in Java☆32Jun 30, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A HDFS-backed ContentsManager implementation for IPython☆23Nov 13, 2024Updated last year
- ☆30Jun 18, 2017Updated 8 years ago
- Functional Airflow DAG definitions.☆38Jul 4, 2017Updated 8 years ago
- UWP custom Pivot control with tab-like headers☆12Oct 15, 2015Updated 10 years ago
- cli wrapper for Teradata data warehouse utilities (BTEQ,etc..)☆24Mar 5, 2012Updated 14 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- Few things we've met during our etl project based on spark☆24Mar 22, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A library for querying Druid data sources with Apache Spark☆23Oct 28, 2020Updated 5 years ago
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,768Jun 18, 2024Updated last year
- A plugin for Airflow that create and manage your DAG with web UI.☆20Nov 28, 2017Updated 8 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Jul 19, 2023Updated 2 years ago
- Python library for sending data to Honeycomb☆20Nov 25, 2024Updated last year
- Turbine: the bare metals that gets you Airflow☆381Oct 10, 2021Updated 4 years ago