SayreBlades / dask-ecsLinks
An opinionated template for spinning up a dask cluster based on docker.
☆14Updated 7 years ago
Alternatives and similar repositories for dask-ecs
Users that are interested in dask-ecs are comparing it to the libraries listed below
Sorting:
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- python parallel map on kubernetes☆34Updated 8 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Docker image for Apache Hive running on Tez☆7Updated 10 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 7 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Deploy dask-distributed on google container engine using kubernetes☆40Updated 6 years ago
- Example Scala/SBT event consumer for Amazon Kinesis☆22Updated 10 years ago
- An umbrella project for multiple implementations of model serving☆45Updated 7 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit i…☆43Updated 11 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- Spark-cloud is a set of scripts for starting spark clusters on ec2☆12Updated 9 years ago
- A Giter8 template for scio☆31Updated 5 months ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated last week
- A scala library for IBM ILOG CPLEX☆19Updated 5 years ago
- Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.☆21Updated 7 months ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Data-ish exploration through SQL+Uncertainty☆27Updated 2 years ago
- Boilerplate Project with Django Channels + React + Redux + WebSocket Middleware☆8Updated 8 years ago
- Utils around luigi.☆66Updated 4 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago