prrao87 / duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆32Updated last year
Alternatives and similar repositories for duckdb-study:
Users that are interested in duckdb-study are comparing it to the libraries listed below
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year
- A repository of runnable examples using ibis☆43Updated 10 months ago
- Fast approximate joins on string columns for polars dataframes.☆12Updated 6 months ago
- Polars Time Series Extension☆27Updated 3 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- rust-for-data☆45Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- Jupyter Cell / Line Magics for DuckDB☆48Updated 3 months ago
- Dask integration for Snowflake☆30Updated 5 months ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆58Updated this week
- Prefect integrations with SQLAlchemy.☆25Updated last year
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆48Updated 2 weeks ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- ☆21Updated 8 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆76Updated 5 months ago
- Prefect integrations for working with Docker☆43Updated last year
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- Exploring some issues related to churn☆16Updated last year
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Cost Efficient Data Pipelines with DuckDB☆52Updated 9 months ago
- Cloud-agnostic Python API☆60Updated 11 months ago
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆14Updated last year
- Stream Processing using Polars☆30Updated 2 years ago
- ☆90Updated last year
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆42Updated last year
- Fast window operations☆41Updated 11 months ago