pydiverse / pydiverse.pipedag
A data pipeline orchestration library for rapid iterative development with automatic cache invalidation allowing users to focus writing their tasks in pandas, polars, sqlalchemy, ibis, and alike.
☆30Updated this week
Alternatives and similar repositories for pydiverse.pipedag:
Users that are interested in pydiverse.pipedag are comparing it to the libraries listed below
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- Assessing whether data from database complies with reference information.☆42Updated 3 weeks ago
- RFC document, tooling and other content related to the dataframe API standard☆106Updated last year
- Extremely lightweight compatibility layer between pandas and Polars☆40Updated 11 months ago
- Sentiment and language detection for text analytics.☆17Updated 8 months ago
- A repository of runnable examples using ibis☆43Updated 8 months ago
- Native polars deltalake reader☆9Updated 7 months ago
- ☆13Updated 3 months ago
- Cluster tools for running Dask on Databricks☆13Updated 9 months ago
- ☆37Updated this week
- Coming soon☆61Updated last year
- Arrow, pydantic style☆82Updated 2 years ago
- Automated, schema-based JSON unpacking to Polars objects☆13Updated last year
- Python bindings and arrow integration for the rust object_store crate.☆63Updated 7 months ago
- Kedro plugin to support running pipelines on Dagster☆10Updated this week
- ☆89Updated 2 months ago
- Polars plugin for stable hashing functionality☆69Updated 4 months ago
- Python package implementing transformers for pre processing steps for machine learning.☆56Updated this week
- Time based splits for cross validation☆38Updated last month
- Easy and flexible data contracts☆125Updated last month
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆15Updated last week
- Minimal plugin loading package for polars with optional type stub generation☆16Updated last month
- Automatically upgrade your Polars code to use the latest syntax available☆63Updated 9 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 6 months ago
- Identifiers and Standard Format Parsing for Polars Dataframe☆15Updated last month
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Flat files, flat land.☆26Updated this week
- Polars Time Series Extension☆26Updated last month
- Provide fine-grained push access to GitHub from a JupyterHub☆26Updated 3 weeks ago
- 🧑🏫 Practical guide to big data analysis, with Python☆21Updated 8 months ago