pydiverse / pydiverse.pipedag
A data pipeline orchestration library for rapid iterative development with automatic cache invalidation allowing users to focus writing their tasks in pandas, polars, sqlalchemy, ibis, and alike.
β30Updated 2 weeks ago
Alternatives and similar repositories for pydiverse.pipedag:
Users that are interested in pydiverse.pipedag are comparing it to the libraries listed below
- Extremely lightweight compatibility layer between pandas and Polarsβ41Updated 11 months ago
- Sentiment and language detection for text analytics.β17Updated 9 months ago
- Automated Jupyter notebook testing. πβ41Updated last year
- RFC document, tooling and other content related to the dataframe API standardβ108Updated last year
- Automated, schema-based JSON unpacking to Polars objectsβ13Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.β108Updated 4 months ago
- Python bindings and arrow integration for the rust object_store crate.β64Updated 8 months ago
- Native polars deltalake readerβ9Updated 8 months ago
- A repository of runnable examples using ibisβ43Updated 9 months ago
- Cluster tools for running Dask on Databricksβ13Updated 10 months ago
- Fast approximate joins on string columns for polars dataframes.β12Updated 6 months ago
- β38Updated this week
- β32Updated 11 months ago
- Minimal plugin loading package for polars with optional type stub generationβ16Updated 2 months ago
- Have UV deal with all your Jupyter deps.β24Updated 7 months ago
- A curated list of polars projects and resources.β37Updated last month
- A place to provide Coiled feedbackβ18Updated last month
- Use pathlib syntax to easily work with Pandas series containing file paths.β69Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debtβ58Updated last month
- Kedro plugin to support running pipelines on Dagsterβ11Updated this week
- A toolbox π§° for Jupyter notebooks π: testing, experiment tracking, debugging, profiling, and more!β68Updated 7 months ago
- Arrow, pydantic styleβ82Updated 2 years ago
- A declarative, π»ββοΈ-native data frame validation library.β135Updated this week
- β14Updated 4 months ago
- β89Updated 3 months ago
- s3pathlib is the python package provides the Pythonic objective oriented programming (OOP) interface to manipulate AWS S3 object / directβ¦β30Updated 2 years ago
- Assessing whether data from database complies with reference information.β42Updated this week
- Copier template for Python projects managed by uv.β87Updated 2 weeks ago
- A hatch plugin to help build Jupyter packagesβ45Updated 10 months ago
- asyncio bridge to the duckdb libraryβ41Updated 2 years ago