pysysops / docker-luigidLinks
Luigi Central Scheduler Server on Docker
☆11Updated 9 years ago
Alternatives and similar repositories for docker-luigid
Users that are interested in docker-luigid are comparing it to the libraries listed below
Sorting:
- An example to illustrate using Luigi to manage a data science workflow in Greenplum Database☆12Updated 7 years ago
- Docker build for Apache Spark☆672Updated 4 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 9 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 6 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- A collection of examples using flinks new python API☆251Updated 9 months ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 6 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆52Updated 9 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Jupyter Notebook extension for Apache Spark integration☆191Updated 5 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- A simple examle for Python Kafka Avro☆86Updated 7 years ago
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Updated 7 years ago
- Docker image of Apache Spark with its Python interface, pyspark.☆40Updated 8 years ago
- PyAthenaJDBC is an Amazon Athena JDBC driver wrapper for the Python DB API 2.0 (PEP 249).☆94Updated 2 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆124Updated 10 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 5 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆176Updated 8 months ago
- A Python MapReduce and HDFS API for Hadoop☆241Updated 3 weeks ago
- Jupyter kernel for scala and spark☆190Updated 2 years ago
- A Python implementation of Apache Kafka Streams☆311Updated 7 years ago
- ☆525Updated last month
- Send summary messages of your Luigi jobs to Slack☆46Updated 6 years ago
- PySpark for Elastic Search☆55Updated 8 years ago
- ☆54Updated 7 years ago
- A short guide for transitioning from Python to Scala☆65Updated 10 years ago
- Course materials for my data pipeline video course with O'Reilly☆201Updated 8 years ago
- A simple package that generates data for tests.☆81Updated 5 years ago