apache / datasketches-python
Apache datasketches
☆22Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for datasketches-python
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated last year
- DuckDB extension that adds support for SQL/PGQ☆77Updated this week
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆85Updated 7 months ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆15Updated last year
- Core C++ Sketch Library☆225Updated 2 weeks ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 4 months ago
- ☆19Updated 2 years ago
- Graph Engine for Exploration and Search☆40Updated 9 months ago
- Explaining Inference Queries with Bayesian Optimization☆10Updated 3 years ago
- ☆19Updated last year
- Apache Arrow PostgreSQL connector☆54Updated 9 months ago
- DuckDB is an in-process SQL OLAP Database Management System☆38Updated this week
- ☆11Updated last year
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆59Updated 4 months ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆45Updated 5 months ago
- Apache datasketches☆87Updated last year
- Ibis Substrait Compiler☆95Updated this week
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆42Updated 2 months ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆47Updated last year
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆35Updated last year
- A polystore database from researchers of the Intel Science and Technology Center for Big Data☆37Updated 2 years ago
- A curated list of example code to collect data from Web APIs using DataPrep.Connector.☆34Updated last year
- ☆19Updated last year
- ☆28Updated this week
- Inspect ML Pipelines in Python in the form of a DAG☆68Updated 8 months ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆69Updated 2 months ago
- ☆15Updated 2 years ago
- Website for DataSketches.☆95Updated this week
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆65Updated this week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆66Updated last week