surister / datasaurus
Data Engineering framework written in Python based in Polars.
☆14Updated 8 months ago
Alternatives and similar repositories for datasaurus:
Users that are interested in datasaurus are comparing it to the libraries listed below
- Native polars deltalake reader☆9Updated 4 months ago
- ☆10Updated last month
- Automated, schema-based JSON unpacking to Polars objects☆13Updated 10 months ago
- Minimal plugin loading package for polars with optional typegen☆12Updated 2 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆61Updated this week
- Integration of Pydantic with Kedro.☆11Updated 5 months ago
- A curated list of polars projects and resources.☆36Updated last year
- 🌎 Polars H3 Geospatial Plugin☆40Updated last week
- Identifiers and Standard Format Parsing for Polars Dataframe☆14Updated 5 months ago
- Time based splits for cross validation☆34Updated 2 weeks ago
- Dask integration for Snowflake☆30Updated 2 months ago
- Prefect integrations for working with Docker☆43Updated 8 months ago
- Sentiment and language detection for text analytics.☆16Updated 6 months ago
- A Databricks Plugin for Kedro☆14Updated 2 weeks ago
- Polars plugin for stable hashing functionality☆58Updated last month
- A library to use `modal` as a backend for `joblib`.☆22Updated this week
- A repository of runnable examples using ibis☆42Updated 6 months ago
- Python bindings and arrow integration for the rust object_store crate.☆61Updated 5 months ago
- Write your dbt models using Ibis☆56Updated last week
- CLI for data platform☆19Updated last year
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- A data pipeline orchestration library for rapid iterative development with automatic cache invalidation allowing users to focus writing t…☆30Updated 2 weeks ago
- Automatically upgrade your Polars code to use the latest syntax available☆62Updated 7 months ago
- Polars plugin for pairwise distance functions☆58Updated last month
- ☆12Updated 2 months ago
- Cloud-agnostic Python API☆61Updated 7 months ago
- Cache the intermediate results of queries on timeseries data in DataFusion.☆18Updated 2 months ago
- Python package implementing transformers for pre processing steps for machine learning.☆53Updated this week
- Rethinking machine learning pipelines☆28Updated last month