gregbaker / spark-celeryLinks
Helper to allow Python Celery tasks to do work in a Spark job.
☆28Updated 3 years ago
Alternatives and similar repositories for spark-celery
Users that are interested in spark-celery are comparing it to the libraries listed below
Sorting:
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- Python DB-API client for Presto☆240Updated 2 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Example for an airflow plugin☆49Updated 9 years ago
- A Python MapReduce and HDFS API for Hadoop☆241Updated 3 weeks ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 6 years ago
- Python client for Spark Jobserver Rest API☆40Updated 5 years ago
- A collection of examples using flinks new python API☆251Updated 9 months ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 8 months ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 5 years ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- Flask app to push/pull on Kafka over HTTP☆41Updated 10 years ago
- Jupyter Notebook extension for Apache Spark integration☆191Updated 5 years ago
- REST-like API exposing Airflow data and operations☆61Updated 7 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 5 years ago
- API and command line interface for HDFS☆276Updated last year
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Updated 7 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- Lightweight Azkaban client☆77Updated 6 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Updated 3 years ago
- Dockerized HDP Cluster☆84Updated 8 years ago
- A Python connector for Druid☆519Updated 4 months ago
- PySpark for Elastic Search☆55Updated 8 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 6 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 5 years ago
- Docker image for Apache Spark☆76Updated 6 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 6 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 5 years ago
- pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4☆69Updated last year