prrao87 / duckdb-studyLinks
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆35Updated 2 years ago
Alternatives and similar repositories for duckdb-study
Users that are interested in duckdb-study are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- Code and materials for Effective Polars book☆83Updated last year
- A repository of runnable examples using ibis☆46Updated last year
- Fast approximate joins on string columns for polars dataframes.☆15Updated 2 weeks ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 5 months ago
- ☆23Updated last year
- Cloud-agnostic Python API☆61Updated last year
- Stream Processing using Polars☆32Updated 2 years ago
- Polars plugin offering eXtra stuff for DateTimes☆229Updated last month
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆94Updated last year
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆45Updated 2 years ago
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Updated 2 years ago
- Templates for your Kedro projects.☆82Updated this week
- Polars Time Series Extension☆32Updated 2 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- A demo of the Mito Streamlit Spreadsheet☆18Updated 2 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 7 months ago
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆13Updated 2 years ago
- ☆15Updated last year
- Polars extension for fzf-style fuzzy matching☆33Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last month
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- Time based splits for cross validation☆39Updated last week
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- Demo on how to use Prefect with Docker☆27Updated 3 years ago
- A FastAPI CLI & Streamlit App wrapper for Excel files... create APIs from Excel data files within seconds☆72Updated 2 years ago
- Lightning fast OLAP-style point queries on Pandas DataFrames.☆127Updated last year
- dagster scikit-learn pipeline example.☆46Updated 2 years ago