prrao87 / duckdb-studyLinks
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆34Updated 2 years ago
Alternatives and similar repositories for duckdb-study
Users that are interested in duckdb-study are comparing it to the libraries listed below
Sorting:
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated 3 weeks ago
- Stream Processing using Polars☆32Updated 2 years ago
- A repository of runnable examples using ibis☆46Updated last year
- Polars plugin offering eXtra stuff for DateTimes☆224Updated last week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Cloud-agnostic Python API☆60Updated last year
- Lightning fast OLAP-style point queries on Pandas DataFrames.☆124Updated 11 months ago
- Fast approximate joins on string columns for polars dataframes.☆14Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆117Updated 3 months ago
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆45Updated 2 years ago
- Code and materials for Effective Polars book☆83Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆89Updated 10 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Polars Time Series Extension☆32Updated 8 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆223Updated this week
- Jupyter Cell / Line Magics for DuckDB☆54Updated 3 weeks ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 2 years ago
- ☆28Updated last year
- Polars extension for fzf-style fuzzy matching☆30Updated last year
- ☆115Updated 3 weeks ago
- Templates for your Kedro projects.☆79Updated last week
- Cost Efficient Data Pipelines with DuckDB☆58Updated 5 months ago
- Project template for Polars Plugins☆79Updated last week
- Dask integration for Snowflake☆30Updated 2 months ago
- Time based splits for cross validation☆39Updated 3 weeks ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- ☆12Updated 2 years ago
- Easy and flexible data contracts☆165Updated 2 weeks ago
- Read Delta tables without any Spark☆47Updated last year
- ☆23Updated last year