gregbaker / spark-celery
Helper to allow Python Celery tasks to do work in a Spark job.
☆27Updated 2 years ago
Alternatives and similar repositories for spark-celery:
Users that are interested in spark-celery are comparing it to the libraries listed below
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 11 months ago
- Python client for Spark Jobserver Rest API☆39Updated 4 years ago
- PySpark for Elastic Search☆55Updated 7 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆82Updated 4 years ago
- Simplify getting Zeppelin up and running☆56Updated 8 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Updated 7 years ago
- Flask app to push/pull on Kafka over HTTP☆41Updated 9 years ago
- pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4☆69Updated 4 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Helpers & syntactic sugar for PySpark.☆61Updated last year
- Docker compose files for various kafka stacks☆32Updated 6 years ago
- Docker images used internally by various Teradata projects for automation, testing, etc☆40Updated 7 years ago
- A simple examle for Python Kafka Avro☆86Updated 6 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 4 years ago
- Apache Drill Dialect for SQL Alchemy☆54Updated 7 months ago
- python library for interacting with SolrCloud☆36Updated 4 years ago
- A Python MapReduce and HDFS API for Hadoop☆238Updated 2 weeks ago
- Python DB-API client for Presto☆239Updated last year
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 4 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- ☆41Updated 8 years ago
- Hadoop Cluster Configurations☆32Updated 3 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated last year
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago