surister / datasaurus
Data Engineering framework written in Python based in Polars.
☆14Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for datasaurus
- Write your dbt models using Ibis☆53Updated last month
- CLI for data platform☆19Updated 11 months ago
- Dask integration for Snowflake☆30Updated last week
- Read Apache Arrow batches from ODBC data sources in Python☆58Updated this week
- Minimal plugin loading package for polars with optional typegen☆13Updated 3 weeks ago
- An experimental Athena extension for DuckDB 🐤☆50Updated 9 months ago
- Identifiers and Standard Format Parsing for Polars Dataframe☆15Updated 4 months ago
- Automated, schema-based JSON unpacking to Polars objects☆14Updated 8 months ago
- Sentiment and language detection for text analytics.☆16Updated 4 months ago
- A repository of runnable examples using ibis☆41Updated 4 months ago
- Prefect integrations for working with Docker☆43Updated 6 months ago
- Time based splits for cross validation☆33Updated last week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆48Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆95Updated last month
- Python bindings and arrow integration for the rust object_store crate.☆57Updated 3 months ago
- Coming soon☆58Updated last year
- Linear regression in SQL using dbt☆66Updated last month
- Rethinking machine learning pipelines☆26Updated this week
- ☆34Updated this week
- Plugin for Intake to read from SQL servers☆15Updated last year
- Makes it easy to use altair from FastHTML☆22Updated last month
- A dbt-Core package for generating models from an activity stream.☆39Updated 7 months ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 8 months ago
- Cloud-agnostic Python API☆60Updated 5 months ago
- A curated list of polars projects and resources.☆35Updated last year
- Alternative admin panel for CrateDB databases☆12Updated 3 months ago
- Cluster tools for running Dask on Databricks☆13Updated 5 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week