dask-contrib / dask-databricksLinks
Cluster tools for running Dask on Databricks
☆14Updated 11 months ago
Alternatives and similar repositories for dask-databricks
Users that are interested in dask-databricks are comparing it to the libraries listed below
Sorting:
- ☆88Updated 4 months ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- "Hacking Dask" tutorial materials☆71Updated 3 years ago
- Material for Inside Dask talk | PyData DC | August 2021☆13Updated 3 years ago
- Streaming and approximate algorithms. WIP, use at own risk.☆26Updated 5 months ago
- Make Polars DataFrames Generic Types☆15Updated last month
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- ☆38Updated this week
- Fast approximate joins on string columns for polars dataframes.☆12Updated 7 months ago
- 🧑🏫 Practical guide to big data analysis, with Python☆22Updated 10 months ago
- A pytest plugin for regression testing and regenerating Jupyter Notebooks☆53Updated this week
- JupyterLab dataset browser for THREDDS catalog☆25Updated 4 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆141Updated last week
- Hatch plugin for conda environments☆40Updated last year
- Advanced algorithms for xarray☆37Updated 2 months ago
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆13Updated 3 years ago
- For general discussion and community planning. Discussion issues welcome.☆23Updated 2 years ago
- Dask integration for Snowflake☆30Updated 6 months ago
- Use pathlib syntax to easily work with Pandas series containing file paths.☆69Updated 2 years ago
- Material for the Jupytext+Papermill blog post☆31Updated 4 years ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- A place to provide Coiled feedback☆19Updated 2 months ago
- API between Parquet files and GeoDataFrames for fast input/output of GIS data. // This project was a proof of concept. For current develo…☆25Updated 5 years ago
- Python library allowing to manipulate data split into a collection of groups stored in Zarr format.☆13Updated this week
- Schema validation for Xarray objects☆42Updated 2 months ago
- Glue JupyterLab Extension☆18Updated 3 weeks ago
- Tools to provide a control plane for managing the lifecycle of Dask clusters.☆25Updated last year
- An experiment to query Xarray datasets with SQL☆32Updated 9 months ago
- Jupyter Widgets library for OpenLayersJS☆22Updated 9 months ago