SayreBlades / dask-ecs
An opinionated template for spinning up a dask cluster based on docker.
☆14Updated 7 years ago
Alternatives and similar repositories for dask-ecs:
Users that are interested in dask-ecs are comparing it to the libraries listed below
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Minimal docker image for running Luigi☆10Updated 8 years ago
- python parallel map on kubernetes☆34Updated 7 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- KnowledgeRepo + JupyterLab☆48Updated 4 months ago
- Example Scala/SBT event consumer for Amazon Kinesis☆22Updated 9 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 7 years ago
- Docker client that reads from yaml files☆22Updated 9 months ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- Programmatic Control Flow☆12Updated 7 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 7 months ago
- Puppet module to provision Airbnb's Airflow☆19Updated 2 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Python module and CLI to package and upload python lambda functions to AWS Lambda☆25Updated 3 years ago
- Docker-izing Data Science Applications CodeLab for QCon AI 2018☆13Updated 6 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.☆21Updated 4 months ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Configuration and definitions of Airflow for OpenTrials☆18Updated 7 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- Infrastructure code to run notebooks on some EC2 nodes☆10Updated 6 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Library to convert OpenTSDB data to pandas datastructures☆15Updated 9 years ago
- Create an nteractive application with zero configuration☆35Updated last year