dask / distributedLinks
A distributed task scheduler for Dask
☆1,664Updated this week
Alternatives and similar repositories for distributed
Users that are interested in distributed are comparing it to the libraries listed below
Sorting:
- Scalable Machine Learning with Dask☆944Updated 4 months ago
- Extended pickling support for Python objects☆1,881Updated 3 months ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,066Updated 2 months ago
- python implementation of the parquet columnar file format.☆881Updated last month
- A Python package to manage extremely large amounts of data☆1,359Updated this week
- Real-time stream processing for python☆1,291Updated last week
- serialize all of Python☆2,426Updated 2 weeks ago
- Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more☆2,397Updated 2 months ago
- parallel graph management and execution in heterogeneous computing☆1,474Updated 2 weeks ago
- Computing with Python functions.☆4,321Updated 3 weeks ago
- An implementation of chunked, compressed, N-dimensional arrays for Python.☆1,908Updated this week
- Robust and reusable Executor for joblib☆606Updated 5 months ago
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆991Updated this week
- NumPy and Pandas interface to Big Data☆3,198Updated 2 years ago
- Dask tutorial☆1,856Updated 3 months ago
- Fast NumPy array functions written in C☆1,160Updated 2 weeks ago
- better multiprocessing and multithreading in Python☆691Updated 2 weeks ago
- Parallel computing with task scheduling☆13,727Updated this week
- N-D labeled arrays and datasets in Python☆4,077Updated last week
- Cython implementation of Toolz: High performance functional utilities☆1,101Updated 2 months ago
- A lightweight Traits like module☆648Updated 3 months ago
- An in-browser Python profile viewer☆2,548Updated last year
- S3 Filesystem☆1,002Updated this week
- HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.☆2,200Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆652Updated this week
- Native Kubernetes integration for Dask☆324Updated 3 weeks ago
- SCOOP (Scalable COncurrent Operations in Python)☆656Updated 2 years ago
- Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler☆642Updated 2 years ago
- Quickly and accurately render even the largest data.☆3,506Updated this week
- With Holoviews, your data visualizes itself.☆2,877Updated this week