apache / datasketches-python
Apache datasketches
☆27Updated 3 weeks ago
Alternatives and similar repositories for datasketches-python:
Users that are interested in datasketches-python are comparing it to the libraries listed below
- ☆30Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- DuckDB is an in-process SQL OLAP Database Management System☆42Updated last week
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆86Updated 2 months ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆78Updated 2 months ago
- Ibis analytics, with Ibis (and more!)☆21Updated 6 months ago
- ☆37Updated last week
- Ibis Substrait Compiler☆100Updated this week
- Graph Engine for Exploration and Search☆40Updated last year
- Core C++ Sketch Library☆230Updated last month
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- deferred computational framework for multi-engine pipelines☆113Updated this week
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆58Updated 6 months ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 9 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- Arrow, pydantic style☆82Updated 2 years ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆101Updated this week
- DuckDB extension that adds support for SQL/PGQ and graph algorithms☆173Updated this week
- Train Gradient Boosting and Random Forest with only SQL (VLDB 2023)☆22Updated last year
- ☆238Updated last week
- Convenient pyarrow operations following the Pandas API☆44Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆54Updated 2 months ago
- Apache DataFusion Benchmarks☆17Updated 4 months ago
- Scalytics Connect development environment, pre-build☆22Updated last year
- Boring Data Tool☆214Updated last year
- ☆89Updated 10 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆190Updated last year
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- DuckDB extension for Delta Lake☆172Updated last week