dask / distributed
A distributed task scheduler for Dask
☆1,612Updated this week
Alternatives and similar repositories for distributed:
Users that are interested in distributed are comparing it to the libraries listed below
- Scalable Machine Learning with Dask☆922Updated last month
- Extended pickling support for Python objects☆1,716Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,032Updated last week
- A Python package to manage extremely large amounts of data☆1,324Updated 3 weeks ago
- Parallel computing with task scheduling☆13,045Updated this week
- Computing with Python functions.☆4,013Updated this week
- An implementation of chunked, compressed, N-dimensional arrays for Python.☆1,643Updated this week
- Real-time stream processing for python☆1,255Updated 4 months ago
- N-D labeled arrays and datasets in Python☆3,744Updated this week
- NumPy and Pandas interface to Big Data☆3,196Updated last year
- Dask tutorial☆1,849Updated last year
- serialize all of Python☆2,324Updated last week
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆893Updated 3 weeks ago
- Fast NumPy array functions written in C☆1,098Updated 5 months ago
- Robust and reusable Executor for joblib☆558Updated this week
- python implementation of the parquet columnar file format.☆817Updated 4 months ago
- A columnar data container that can be compressed.☆957Updated 2 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,032Updated 2 months ago
- Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more☆2,282Updated this week
- A specification that python filesystems should adhere to.☆1,124Updated this week
- Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler☆643Updated last year
- Concurrent data pipelines in Python >>>☆1,573Updated last year
- Quilt is a data mesh for connecting people with actionable data☆1,331Updated this week
- A lightweight Traits like module☆634Updated last month
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,824Updated last year
- Ahead of Time compiler for numeric kernels☆2,025Updated this week
- Development tool to measure, monitor and analyze the memory behavior of Python objects in a running Python application.☆1,248Updated 8 months ago
- S3 Filesystem☆923Updated 2 weeks ago
- Fast Avro for Python☆659Updated this week
- Native Kubernetes integration for Dask☆316Updated last week