A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
☆73Sep 20, 2019Updated 6 years ago
Alternatives and similar repositories for airflow-spark-operator-plugin
Users that are interested in airflow-spark-operator-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176Apr 13, 2026Updated last month
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Dec 16, 2020Updated 5 years ago
- A plugin to Apache Airflow to allow you to run Zip and UnZip commands as an Operator☆12Jul 26, 2023Updated 2 years ago
- Airflow plugin to transfer arbitrary files between operators☆79Oct 19, 2018Updated 7 years ago
- Ansible role to deploy and configure Airflow☆41Jun 4, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Nov 18, 2019Updated 6 years ago
- Airflow configuration for Telemetry☆204May 29, 2026Updated last week
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Aug 24, 2022Updated 3 years ago
- A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).☆18Apr 20, 2024Updated 2 years ago
- Spark SQS Amazon queue receiver☆24Nov 22, 2021Updated 4 years ago
- Experiments produced during an end of studies project (ETS, H2018)☆14Nov 21, 2018Updated 7 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- 一款结合任务管理和番茄工作法的效率工具,帮助用户将注意力放在当天的任务执行和明天的计划上☆12May 16, 2018Updated 8 years ago
- A focused web crawler based on Playwright, RMQ, Kafka and Flink.☆14Feb 4, 2021Updated 5 years ago
- Kong Serverless Plugins - this plugin has been moved into https://github.com/Kong/kong, please open issues and PRs in that repo☆17Sep 1, 2021Updated 4 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- Source code for SIMD benchmarks and experiments in Java☆32Jun 30, 2017Updated 8 years ago
- go实现的企业级swagger文档管理中心☆11Mar 16, 2018Updated 8 years ago
- A HDFS-backed ContentsManager implementation for IPython☆23Nov 13, 2024Updated last year
- ETL best practices with airflow, with examples☆1,354Sep 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Functional Airflow DAG definitions.☆38Jul 4, 2017Updated 8 years ago
- cli wrapper for Teradata data warehouse utilities (BTEQ,etc..)☆24Mar 5, 2012Updated 14 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- A library for querying Druid data sources with Apache Spark☆23Oct 28, 2020Updated 5 years ago
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,772Jun 18, 2024Updated last year
- 基于beego框架的人脸识别系统☆11Dec 19, 2017Updated 8 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Jul 19, 2023Updated 2 years ago
- Turbine: the bare metals that gets you Airflow☆380Oct 10, 2021Updated 4 years ago
- A collection of airflow sample workflows for data processing on aws☆12Dec 1, 2017Updated 8 years ago
- API gateway with Akka HTTP☆51Nov 27, 2020Updated 5 years ago
- Docker Apache Airflow☆3,807Mar 1, 2023Updated 3 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆349Jul 24, 2018Updated 7 years ago
- A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, s…☆32Jul 25, 2022Updated 3 years ago