modin-project / unidist
Unified Distributed Execution
☆51Updated 4 months ago
Alternatives and similar repositories for unidist:
Users that are interested in unidist are comparing it to the libraries listed below
- RFC document, tooling and other content related to the dataframe API standard☆106Updated 11 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆23Updated last year
- Ibis Substrait Compiler☆99Updated this week
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last week
- IbisML is a library for building scalable ML pipelines using Ibis.☆104Updated 2 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- A place to provide Coiled feedback☆17Updated 2 weeks ago
- Serverless Python with Ray☆55Updated 2 years ago
- Distributed Task Queue based Dask☆38Updated last year
- Ray provider for Apache Airflow☆47Updated last year
- ☆37Updated last week
- An abstraction layer for parameter tuning☆35Updated 6 months ago
- Ray-based Apache Beam runner☆43Updated last year
- ipywidgets library for drawing directed acyclic graphs in jupyterlab using dagre-d3☆79Updated 4 months ago
- Coming soon☆60Updated last year
- Dockerfile templates for creating RAPIDS Docker Images☆74Updated this week
- ☆89Updated last month
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 4 years ago
- A Delta Lake reader for Dask☆49Updated 5 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆139Updated last month
- Extremely lightweight compatibility layer between pandas and Polars☆40Updated 10 months ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- Arrow, pydantic style☆82Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- general functions for your data .pipe()-lines.☆16Updated last year
- JupyterLab Extension to easily share a link to a running server on Binder☆53Updated 9 months ago
- ☆44Updated 7 months ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 6 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆20Updated last year