prrao87 / duckdb-studyLinks
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆35Updated 2 years ago
Alternatives and similar repositories for duckdb-study
Users that are interested in duckdb-study are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- A repository of runnable examples using ibis☆46Updated last year
- ☆28Updated last year
- Polars plugin offering eXtra stuff for DateTimes☆229Updated last month
- Stream Processing using Polars☆32Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 5 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago
- Cloud-agnostic Python API☆61Updated last year
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- ☆116Updated last month
- Project template for Polars Plugins☆81Updated last month
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last month
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Templates for your Kedro projects.☆82Updated this week
- Fast approximate joins on string columns for polars dataframes.☆15Updated 2 weeks ago
- Lightning fast OLAP-style point queries on Pandas DataFrames.☆127Updated last year
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Code and materials for Effective Polars book☆83Updated last year
- Jupyter Cell / Line Magics for DuckDB☆55Updated 3 months ago
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 7 months ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- Polars Time Series Extension☆32Updated 2 months ago
- Extremely lightweight compatibility layer between pandas and Polars☆41Updated last year
- Time based splits for cross validation☆39Updated last week
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆45Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Dask integration for Snowflake☆30Updated 5 months ago