SayreBlades / dask-ecs
An opinionated template for spinning up a dask cluster based on docker.
☆14Updated 7 years ago
Alternatives and similar repositories for dask-ecs:
Users that are interested in dask-ecs are comparing it to the libraries listed below
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- python parallel map on kubernetes☆34Updated 7 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Docker image for Apache Hive running on Tez☆7Updated 10 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Example Scala/SBT event consumer for Amazon Kinesis☆22Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 7 months ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆15Updated 8 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Example notebooks using BlazingSQL with the RAPIDS AI ecoystem.☆15Updated 5 years ago
- This module provides methods to fetch data from OpenTSDB HTTP interface and convert them into Pandas Timeseries object.☆14Updated 8 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- ☆12Updated 8 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- KnowledgeRepo + JupyterLab☆48Updated 5 months ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Tiny Dask Docker images based on Alpine Linux☆17Updated 5 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- Small Docker image with Python Machine Learning tools (~180MB) https://hub.docker.com/r/frolvlad/alpine-python-machinelearning/☆80Updated 3 weeks ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago