modin-project / unidistLinks
Unified Distributed Execution
β54Updated 8 months ago
Alternatives and similar repositories for unidist
Users that are interested in unidist are comparing it to the libraries listed below
Sorting:
- RFC document, tooling and other content related to the dataframe API standardβ110Updated last year
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated 9 months ago
- A Python package that parses sql and converts it to ibis expressionsβ54Updated last year
- A toolbox π§° for Jupyter notebooks π: testing, experiment tracking, debugging, profiling, and more!β67Updated 9 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β113Updated last year
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooksβ21Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.β110Updated 6 months ago
- Python binding for DataFusionβ59Updated 2 years ago
- β99Updated 2 weeks ago
- A repository of runnable examples using ibisβ44Updated last year
- A place to provide Coiled feedbackβ20Updated 4 months ago
- Coming soonβ61Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.β24Updated last year
- ipywidgets library for drawing directed acyclic graphs in jupyterlab using dagre-d3β85Updated 7 months ago
- A library to use `modal` as a backend for `joblib`.β29Updated 5 months ago
- β89Updated 5 months ago
- π― aimrocks πΈ β python & cython bindings for RocksDB. Batteries included! πβ32Updated 4 months ago
- Ibis Substrait Compilerβ103Updated this week
- A Delta Lake reader for Daskβ53Updated last week
- A robust DAG implementation for parallel executionβ71Updated last year
- An abstraction layer for parameter tuningβ35Updated 10 months ago
- Dask integration for Snowflakeβ30Updated 7 months ago
- Serverless Python with Rayβ57Updated 2 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...β143Updated this week
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.β50Updated 2 years ago
- Distributed XGBoost on Rayβ149Updated last year
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!β133Updated this week
- βοΈ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.β45Updated 4 months ago
- Arrow, pydantic styleβ84Updated 2 years ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trinoβ88Updated last month