pydiverse / pydiverse.pipedag
A data pipeline orchestration library for rapid iterative development with automatic cache invalidation allowing users to focus writing their tasks in pandas, polars, sqlalchemy, ibis, and alike.
☆30Updated this week
Alternatives and similar repositories for pydiverse.pipedag:
Users that are interested in pydiverse.pipedag are comparing it to the libraries listed below
- Extremely lightweight compatibility layer between pandas and Polars☆40Updated 11 months ago
- Coming soon☆61Updated last year
- Rethinking machine learning pipelines☆28Updated 4 months ago
- Automated, schema-based JSON unpacking to Polars objects☆13Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- Sentiment and language detection for text analytics.☆17Updated 9 months ago
- Dask integration for Snowflake☆30Updated 4 months ago
- ☆37Updated last week
- Time based splits for cross validation☆38Updated last month
- Assessing whether data from database complies with reference information.☆42Updated 3 weeks ago
- Easy and flexible data contracts☆125Updated last month
- Polars plugin for stable hashing functionality☆69Updated 4 months ago
- Polars plugin for pairwise distance functions☆65Updated 3 weeks ago
- RFC document, tooling and other content related to the dataframe API standard☆106Updated last year
- Distributed Task Queue based Dask☆38Updated last year
- Cluster tools for running Dask on Databricks☆13Updated 10 months ago
- Fast approximate joins on string columns for polars dataframes.☆12Updated 5 months ago
- 🧑🏫 Practical guide to big data analysis, with Python☆21Updated 8 months ago
- A repository of runnable examples using ibis☆43Updated 8 months ago
- Minimal plugin loading package for polars with optional type stub generation☆16Updated last month
- Automatically upgrade your Polars code to use the latest syntax available☆63Updated 9 months ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 4 years ago
- Native polars deltalake reader☆9Updated 7 months ago
- ☆89Updated 2 months ago
- Hatch plugin for conda environments☆40Updated 10 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 6 months ago
- Just a super thin wrapper for Python tasks that form a flow.☆21Updated last month
- ☆13Updated 3 months ago
- Have UV deal with all your Jupyter deps.☆24Updated 6 months ago
- A curated list of polars projects and resources.☆37Updated 2 weeks ago