prrao87 / duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆31Updated last year
Alternatives and similar repositories for duckdb-study:
Users that are interested in duckdb-study are comparing it to the libraries listed below
- Example project for building scalable data pipelines with Kedro and Ibis.☆12Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆69Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆112Updated 10 months ago
- A repository of runnable examples using ibis☆42Updated 7 months ago
- An abstraction layer for parameter tuning☆35Updated 5 months ago
- Cloud-agnostic Python API☆61Updated 8 months ago
- Dask integration for Snowflake☆30Updated 3 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆100Updated last month
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Time based splits for cross validation☆35Updated 2 weeks ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- ☆30Updated last year
- Extremely lightweight compatibility layer between pandas and Polars☆39Updated 9 months ago
- Polars plugin for pairwise distance functions☆62Updated 2 months ago
- Read Delta tables without any Spark☆47Updated 11 months ago
- Polars plugin offering eXtra stuff for DateTimes☆197Updated 2 months ago
- Jupyter Cell / Line Magics for DuckDB☆45Updated last week
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Updated last year
- Unified Distributed Execution☆51Updated 3 months ago
- Code and materials for Effective Polars book☆73Updated 10 months ago
- ☆84Updated last month
- Coming soon☆59Updated last year
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated last year
- Python package implementing transformers for pre processing steps for machine learning.☆54Updated last week
- A FastAPI CLI & Streamlit App wrapper for Excel files... create APIs from Excel data files within seconds☆70Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 5 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- Sentiment and language detection for text analytics.☆16Updated 7 months ago