dask-contrib / dask-databricks
Cluster tools for running Dask on Databricks
β13Updated 9 months ago
Alternatives and similar repositories for dask-databricks:
Users that are interested in dask-databricks are comparing it to the libraries listed below
- β89Updated last month
- π§βπ« Practical guide to big data analysis, with Pythonβ21Updated 8 months ago
- Material for Inside Dask talk | PyData DC | August 2021β13Updated 3 years ago
- Tools for making Prefect work better for typical data science workflowsβ19Updated 3 years ago
- A place to provide Coiled feedbackβ17Updated 2 weeks ago
- Streaming and approximate algorithms. WIP, use at own risk.β26Updated 2 months ago
- A pytest plugin for regression testing and regenerating Jupyter Notebooksβ51Updated last week
- Use pathlib syntax to easily work with Pandas series containing file paths.β69Updated last year
- "Hacking Dask" tutorial materialsβ71Updated 3 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...β139Updated last month
- Extension to hypothesis for testing numpy general universal functionsβ39Updated 3 years ago
- Automated Jupyter notebook testing. πβ41Updated last year
- Jupyter Server Proxy for Panelβ13Updated 2 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediaryβ15Updated 4 years ago
- Application creator and general launcher for JupyterHubβ37Updated last week
- Bidirectional communication for the HoloViz ecosystemβ33Updated 2 months ago
- An extension to add Prefect flow visualizations into you Sphinx documentation.β13Updated 3 years ago
- β12Updated 3 months ago
- For general discussion and community planning. Discussion issues welcome.β22Updated 2 years ago
- Hatch plugin for conda environmentsβ40Updated 10 months ago
- JupyterLab dataset browser for THREDDS catalogβ25Updated 4 years ago
- Extremely lightweight compatibility layer between pandas and Polarsβ40Updated 10 months ago
- A multi-tenant server for securely deploying and managing Dask clusters.β137Updated 2 weeks ago
- π Documentation for Nebariβ15Updated this week
- Time based splits for cross validationβ37Updated 2 weeks ago
- β37Updated last week
- An abstraction layer for parameter tuningβ35Updated 6 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β28Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.β28Updated 2 months ago