dask / dask-tutorial
Dask tutorial
☆1,845Updated last year
Alternatives and similar repositories for dask-tutorial:
Users that are interested in dask-tutorial are comparing it to the libraries listed below
- Scalable Machine Learning with Dask☆922Updated last month
- Easy-to-run example notebooks for Dask☆376Updated last year
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,584Updated 11 months ago
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,175Updated this week
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆3,067Updated last year
- High-level tools to simplify visualization in Python.☆864Updated 3 months ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,354Updated 5 months ago
- A distributed task scheduler for Dask☆1,609Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,055Updated this week
- Tools for diffing and merging of Jupyter notebooks.☆2,713Updated 5 months ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,323Updated 3 weeks ago
- python implementation of the parquet columnar file format.☆814Updated 4 months ago
- 📚 Parameterize, execute, and analyze notebooks☆6,106Updated 2 months ago
- Data Analysis Baseline Library☆728Updated 2 months ago
- Describing statistical models in Python using symbolic formulas☆965Updated this week
- Parallel computing with task scheduling☆13,011Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,032Updated this week
- N-D labeled arrays and datasets in Python☆3,735Updated this week
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,402Updated this week
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,496Updated 3 months ago
- Easy pipelines for pandas DataFrames.☆717Updated 4 months ago
- Parallel computing in Python tutorial materials☆304Updated 5 years ago
- Real-time stream processing for python☆1,257Updated 3 months ago
- With Holoviews, your data visualizes itself.☆2,761Updated this week
- NumPy and Pandas interface to Big Data☆3,195Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,766Updated 2 years ago
- Bokeh Plotting Backend for Pandas and GeoPandas☆882Updated 11 months ago
- bamboolib - a GUI for pandas DataFrames☆945Updated last year
- A Python package for manipulating 2-dimensional tabular data structures☆1,823Updated last week
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆516Updated 2 months ago