modin-project / unidist
Unified Distributed Execution
☆51Updated 6 months ago
Alternatives and similar repositories for unidist:
Users that are interested in unidist are comparing it to the libraries listed below
- Python binding for DataFusion☆59Updated 2 years ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- Ibis Substrait Compiler☆102Updated this week
- An abstraction layer for parameter tuning☆35Updated 7 months ago
- Coming soon☆61Updated last year
- A place to provide Coiled feedback☆18Updated last month
- Arrow, pydantic style☆82Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- ☆89Updated 3 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- Distributed Task Queue based Dask☆38Updated last year
- Ray-based Apache Beam runner☆43Updated last year
- ☆38Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- A repository of runnable examples using ibis☆43Updated 9 months ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated 11 months ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 7 months ago
- Dockerfile templates for creating RAPIDS Docker Images☆77Updated last week
- ☆89Updated 3 weeks ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆20Updated last year
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆78Updated 9 months ago
- Data and tooling to compare the API surfaces of various array libraries.☆54Updated 2 months ago
- A Delta Lake reader for Dask☆49Updated 6 months ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆102Updated this week
- ipywidgets library for drawing directed acyclic graphs in jupyterlab using dagre-d3☆81Updated 5 months ago
- Fuzzy Data Benchmark☆17Updated last year