prrao87 / duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for duckdb-study
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Updated 11 months ago
- An abstraction layer for parameter tuning☆36Updated 2 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Demo on how to use Prefect with Docker☆26Updated 2 years ago
- ☆43Updated 3 months ago
- Dask integration for Snowflake☆30Updated last week
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 2 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- A Modal that works with Panel in both server and notebook environments.☆20Updated 11 months ago
- A repository of runnable examples using ibis☆41Updated 4 months ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 2 years ago
- Cloud-agnostic Python API☆60Updated 5 months ago
- ☆14Updated last year
- ☆20Updated 3 years ago
- Example of configuring multiplage apps via a custom config file☆18Updated last year
- dagster scikit-learn pipeline example.☆43Updated last year
- The easiest way to integrate Kedro and Great Expectations☆53Updated last year
- Jupyter Cell / Line Magics for DuckDB☆40Updated last week
- Stupidly simple Python Package to build front end dashboards within Python☆20Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆63Updated last month
- Time based splits for cross validation☆33Updated last week
- Python package for text mining of time-series data☆68Updated 2 months ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- Exploring some issues related to churn☆17Updated 8 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆95Updated last month
- Automated Jupyter notebook testing. 📙☆41Updated 9 months ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆53Updated 3 weeks ago