spiside / docker-luigi
A project to help develop Luigi pipelines using Docker ✳️
☆78Updated 3 years ago
Related projects: ⓘ
- 🐍 🐳 Luigi in Docker - alpine and ubuntu images available☆50Updated 3 years ago
- A luigi powered analytics / warehouse stack☆87Updated 7 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 5 years ago
- Send summary messages of your Luigi jobs to Slack☆46Updated 5 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 3 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 11 months ago
- Open source Flotilla☆192Updated last week
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 5 years ago
- Collection of dask example notebooks☆57Updated 6 years ago
- Required packages for using pandas in AWS Lambda functions☆45Updated 8 years ago
- ☆54Updated 5 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 3 years ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.☆22Updated 8 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆85Updated 8 years ago
- Docker container to make running Luigi tasks real easy.☆11Updated 8 years ago
- A short guide for transitioning from Python to Scala☆65Updated 8 years ago
- Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.☆24Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆89Updated 3 years ago
- Public repository for versioning machine learning data☆43Updated 2 years ago
- Slides produced by Engineers and Data Scientists of Blue Yonder☆51Updated 4 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 8 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 6 years ago