surister / datasaurus
Data Engineering framework written in Python based in Polars.
☆14Updated 9 months ago
Alternatives and similar repositories for datasaurus:
Users that are interested in datasaurus are comparing it to the libraries listed below
- Minimal plugin loading package for polars with optional typegen☆13Updated 3 months ago
- ☆10Updated 2 months ago
- Python package implementing transformers for pre processing steps for machine learning.☆54Updated last week
- Native polars deltalake reader☆9Updated 5 months ago
- Sentiment and language detection for text analytics.☆16Updated 7 months ago
- Polars plugin for stable hashing functionality☆60Updated 2 months ago
- A repository of runnable examples using ibis☆42Updated 7 months ago
- Write your dbt models using Ibis☆59Updated last month
- Time based splits for cross validation☆35Updated 2 weeks ago
- ☆16Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆52Updated this week
- Jupyter Cell / Line Magics for DuckDB☆45Updated last week
- Automatically upgrade your Polars code to use the latest syntax available☆62Updated 8 months ago
- Prefect integrations for working with Docker☆43Updated 9 months ago
- Linear regression in SQL using dbt☆68Updated last month
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆25Updated 8 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆63Updated this week
- Polars plugin for pairwise distance functions☆62Updated 2 months ago
- CLI for data platform☆19Updated last year
- ☆11Updated 3 months ago
- ☆26Updated 5 months ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆34Updated last year
- [Project moved] Polars integration for Dagster☆36Updated 11 months ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 2 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆100Updated last month
- 🌎 Polars H3 Geospatial Plugin☆52Updated 2 weeks ago