apache / datasketches-python
Apache datasketches
☆24Updated this week
Alternatives and similar repositories for datasketches-python:
Users that are interested in datasketches-python are comparing it to the libraries listed below
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- ☆19Updated last year
- DuckDB is an in-process SQL OLAP Database Management System☆42Updated this week
- Ibis Substrait Compiler☆98Updated this week
- Apache Arrow Flight SQL adapter for PostgreSQL☆75Updated last month
- Train Gradient Boosting and Random Forest with only SQL (VLDB 2023)☆22Updated last year
- Core C++ Sketch Library☆229Updated this week
- Arrow, pydantic style☆84Updated 2 years ago
- ☆215Updated this week
- ☆68Updated last month
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Apache DataFusion Benchmarks☆16Updated 3 months ago
- Write your dbt models using Ibis☆59Updated last month
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆92Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆192Updated this week
- ☆87Updated 9 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆67Updated last week
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- ☆54Updated last year
- ☆34Updated last week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆73Updated this week
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆56Updated 5 months ago
- An experimental Athena extension for DuckDB 🐤☆53Updated last month
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated last month
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆169Updated this week
- ✨ A Pydantic to PySpark schema library☆69Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- DuckDB extension that adds support for SQL/PGQ and graph algorithms☆145Updated this week