CoorpAcademy / docker-pyspark
Docker image of Apache Spark with its Python interface, pyspark.
☆40Updated 6 years ago
Related projects: ⓘ
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 3 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆85Updated 8 years ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- A luigi powered analytics / warehouse stack☆87Updated 7 years ago
- Airflow workflow management platform chef cookbook.☆67Updated 5 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 7 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 5 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated last year
- pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4☆69Updated last year
- Airflow training for the crunch conf☆105Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆192Updated 3 months ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆171Updated 10 months ago
- Send summary messages of your Luigi jobs to Slack☆46Updated 5 years ago
- Code Repository for the EVO-ODAS☆31Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆53Updated 7 years ago
- fast and scalable Airflow on Kubernetes Setup.☆28Updated last year
- This is a simple streaming application that utilises Kafka and Python☆45Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 11 months ago
- ☆28Updated 3 years ago
- Load data from redshift into a pandas DataFrame and vice versa.☆138Updated last year
- Updated repository☆157Updated 2 years ago
- ☆39Updated this week
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 4 years ago
- A project to help develop Luigi pipelines using Docker ✳️☆78Updated 3 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- ☆54Updated 5 years ago
- Learn the pyspark API through pictures and simple examples☆168Updated 3 years ago