danielbeach / PolarsVsPySparkLinks
can Polars crunch 27GBs of data faster than Pyspark?
☆13Updated 2 years ago
Alternatives and similar repositories for PolarsVsPySpark
Users that are interested in PolarsVsPySpark are comparing it to the libraries listed below
Sorting:
- Code and materials for Effective Polars book☆84Updated last year
- This repository contains coding interviews that I have encountered in company interviews☆12Updated 5 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 8 months ago
- A repository of runnable examples using ibis☆46Updated last year
- Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide☆300Updated last month
- csv and flat-file sniffer built in Rust.☆45Updated 2 years ago
- Recipes for using Python's polars library☆271Updated last year
- Data Analysis with Polars, Published by Packt☆32Updated last year
- ☆31Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆120Updated 6 months ago
- Python package implementing ML feature engineering and pre-processing for polars or pandas dataframes.☆88Updated this week
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Updated 3 years ago
- Fast and easy echarts with polars backend for wrangling and a simple API☆33Updated last month
- 📚 A curated collection of marimo notebooks for education.☆260Updated last week
- Turn SciKitLearn pipelines into SQL☆109Updated this week
- ☆30Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Polars Cookbook, Published by Packt☆355Updated last month
- Book documentation of the Polars DataFrame library☆191Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- Project template for Polars Plugins☆81Updated last month
- ☆22Updated 3 years ago
- Getting started with DuckDB, by Packt Publishing☆69Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Read Apache Arrow batches from ODBC data sources in Python☆74Updated 2 weeks ago
- Sentiment and language detection for text analytics.☆17Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆35Updated 2 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago