A distributed task scheduler for Dask
☆1,665Mar 19, 2026Updated this week
Alternatives and similar repositories for distributed
Users that are interested in distributed are comparing it to the libraries listed below
Sorting:
- Parallel computing with task scheduling☆13,765Mar 12, 2026Updated last week
- Scalable Machine Learning with Dask☆945Sep 27, 2025Updated 5 months ago
- Deploy Dask on job schedulers like PBS, SLURM, and SGE☆254Dec 19, 2025Updated 3 months ago
- Native Kubernetes integration for Dask☆324Mar 2, 2026Updated 2 weeks ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆143Mar 2, 2026Updated 2 weeks ago
- Docker images for dask☆244Feb 2, 2026Updated last month
- JupyterLab extension for Dask☆328Jun 2, 2025Updated 9 months ago
- Dask tutorial☆1,855Nov 4, 2025Updated 4 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆146Oct 14, 2025Updated 5 months ago
- N-D labeled arrays and datasets in Python☆4,113Mar 12, 2026Updated last week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Extended pickling support for Python objects☆1,903Nov 5, 2025Updated 4 months ago
- An implementation of chunked, compressed, N-dimensional arrays for Python.☆1,936Mar 13, 2026Updated last week
- Easy-to-run example notebooks for Dask☆387Nov 26, 2025Updated 3 months ago
- kubernetes setup to bootstrap distributed on google container engine☆66Jun 14, 2019Updated 6 years ago
- Disk-to-disk chunk transformation for chunked arrays.☆175Mar 9, 2026Updated last week
- Computing with Python functions.☆4,329Mar 3, 2026Updated 2 weeks ago
- Real-time stream processing for python☆1,294Feb 24, 2026Updated 3 weeks ago
- Concurrent appendable key-value storage☆107Jul 15, 2024Updated last year
- python implementation of the parquet columnar file format.☆890Updated this week
- Useful Mutable Mappings☆72Oct 31, 2023Updated 2 years ago
- ☆89Jan 21, 2025Updated last year
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,580Mar 13, 2026Updated last week
- NumPy aware dynamic Python compiler using LLVM☆10,935Updated this week
- Start a cluster in EC2 for dask.distributed☆105Nov 3, 2020Updated 5 years ago
- ☆76Jan 23, 2026Updated last month
- S3 Filesystem☆1,016Mar 13, 2026Updated last week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 3 months ago
- A specification that python filesystems should adhere to.☆1,288Feb 17, 2026Updated last month
- A Python package providing buffer compression and transformation codecs for use in data storage and communication applications.☆141Mar 9, 2026Updated last week
- Collection of dask example notebooks☆57Feb 14, 2018Updated 8 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,492Mar 1, 2026Updated 2 weeks ago
- A functional standard library for Python.☆5,128Jan 1, 2026Updated 2 months ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,071Mar 9, 2026Updated last week
- cuDF - GPU DataFrame Library☆9,558Updated this week
- Data Migration for the Blaze Project☆1,004Jul 15, 2022Updated 3 years ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated last month
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,773Mar 13, 2026Updated last week
- Experimental docker-compose setup to bootstrap distributed on a docker-swarm cluster.☆92Jan 11, 2018Updated 8 years ago