A distributed task scheduler for Dask
☆1,667Updated this week
Alternatives and similar repositories for distributed
Users that are interested in distributed are comparing it to the libraries listed below
Sorting:
- Parallel computing with task scheduling☆13,746Updated this week
- Scalable Machine Learning with Dask☆945Sep 27, 2025Updated 5 months ago
- Native Kubernetes integration for Dask☆324Jan 13, 2026Updated last month
- Deploy Dask on job schedulers like PBS, SLURM, and SGE☆254Dec 19, 2025Updated 2 months ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆143Feb 17, 2026Updated last week
- Docker images for dask☆244Feb 2, 2026Updated 3 weeks ago
- Dask tutorial☆1,856Nov 4, 2025Updated 3 months ago
- JupyterLab extension for Dask☆328Jun 2, 2025Updated 8 months ago
- Extended pickling support for Python objects☆1,894Nov 5, 2025Updated 3 months ago
- N-D labeled arrays and datasets in Python☆4,096Updated this week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆146Oct 14, 2025Updated 4 months ago
- An implementation of chunked, compressed, N-dimensional arrays for Python.☆1,919Feb 21, 2026Updated last week
- Easy-to-run example notebooks for Dask☆388Nov 26, 2025Updated 3 months ago
- Computing with Python functions.☆4,325Feb 6, 2026Updated 3 weeks ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Concurrent appendable key-value storage☆107Jul 15, 2024Updated last year
- python implementation of the parquet columnar file format.☆889Jan 6, 2026Updated last month
- Real-time stream processing for python☆1,293Updated this week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 2 months ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,529Updated this week
- NumPy aware dynamic Python compiler using LLVM☆10,921Feb 20, 2026Updated last week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,362Feb 10, 2026Updated 2 weeks ago
- kubernetes setup to bootstrap distributed on google container engine☆66Jun 14, 2019Updated 6 years ago
- cuDF - GPU DataFrame Library☆9,498Updated this week
- S3 Filesystem☆1,009Feb 18, 2026Updated last week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,475Feb 5, 2026Updated 3 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,413Updated this week
- A functional standard library for Python.☆5,118Jan 1, 2026Updated last month
- NumPy and Pandas interface to Big Data☆3,197Sep 29, 2023Updated 2 years ago
- Useful Mutable Mappings☆72Oct 31, 2023Updated 2 years ago
- Disk-to-disk chunk transformation for chunked arrays.☆175Updated this week
- A specification that python filesystems should adhere to.☆1,285Feb 17, 2026Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆653Feb 4, 2026Updated 3 weeks ago
- ☆76Jan 23, 2026Updated last month
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,068Feb 10, 2026Updated 2 weeks ago
- JupyterLab computational environment.☆15,018Updated this week
- Data Migration for the Blaze Project☆1,005Jul 15, 2022Updated 3 years ago
- Declarative visualization library for Python☆10,268Updated this week
- the portable Python dataframe library☆6,404Feb 21, 2026Updated last week