surister / datasaurus
Data Engineering framework written in Python based in Polars.
ā14Updated 6 months ago
Related projects ā
Alternatives and complementary repositories for datasaurus
- Read Apache Arrow batches from ODBC data sources in Pythonā57Updated 2 weeks ago
- Poetry plugin for creating docker images. šā18Updated last week
- CLI for data platformā19Updated 11 months ago
- Linear regression in SQL using dbtā65Updated last month
- Time based splits for cross validationā31Updated this week
- Python package implementing transformers for pre processing steps for machine learning.ā40Updated this week
- A serverless duckDB deployment at GCPā35Updated 2 years ago
- A repository of runnable examples using ibisā40Updated 4 months ago
- Minimal plugin loading package for polars with optional typegenā13Updated last week
- IbisML is a library for building scalable ML pipelines using Ibis.ā93Updated last month
- Sentiment and language detection for text analytics.ā16Updated 4 months ago
- An experimental Athena extension for DuckDB š¤ā49Updated 8 months ago
- Dask integration for Snowflakeā30Updated 4 months ago
- rust-for-dataā43Updated last year
- Write your dbt models using Ibisā52Updated 3 weeks ago
- Identifiers and Standard Format Parsing for Polars Dataframeā14Updated 3 months ago
- Automated, schema-based JSON unpacking to Polars objectsā13Updated 7 months ago
- A monorepo of many Rill example projectsā31Updated this week
- Project template for Polars Pluginsā63Updated last month
- Python bindings and arrow integration for the rust object_store crate.ā56Updated 3 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, Sā¦ā12Updated 3 months ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debtā52Updated 2 weeks ago
- Prefect integrations for working with Dockerā43Updated 6 months ago
- Fake Pandas / PySpark DataFrame creatorā43Updated 8 months ago
- Coming soonā58Updated last year
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)ā35Updated 2 years ago
- Run mssql scripts from Python.ā14Updated last week
- Jupyter Cell / Line Magics for DuckDBā38Updated last month
- Data-aware orchestration with dagster, dbt, and airbyteā30Updated last year