prrao87 / duckdb-studyLinks
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
☆35Updated 2 years ago
Alternatives and similar repositories for duckdb-study
Users that are interested in duckdb-study are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- A repository of runnable examples using ibis☆46Updated last year
- Polars plugin offering eXtra stuff for DateTimes☆229Updated last month
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated 3 weeks ago
- Stream Processing using Polars☆32Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆45Updated 2 years ago
- Templates for your Kedro projects.☆83Updated last week
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆120Updated 6 months ago
- Code and materials for Effective Polars book☆84Updated last year
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- Lightning fast OLAP-style point queries on Pandas DataFrames.☆127Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last month
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆95Updated last year
- ☆116Updated last week
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 8 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 3 years ago
- Dask integration for Snowflake☆30Updated 5 months ago
- Fast approximate joins on string columns for polars dataframes.☆15Updated last month
- ☆22Updated 2 weeks ago
- ☆28Updated last year
- Cloud-agnostic Python API☆60Updated last year
- Read Delta tables without any Spark☆47Updated last year
- Demo on how to use Prefect with Docker☆27Updated 3 years ago
- Polars Time Series Extension☆33Updated 2 months ago
- Easy and flexible data contracts☆171Updated 2 weeks ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆135Updated 2 years ago