suhailrehman / fuzzydata
Fuzzy Data Benchmark
☆17Updated 11 months ago
Alternatives and similar repositories for fuzzydata:
Users that are interested in fuzzydata are comparing it to the libraries listed below
- Ibis Substrait Compiler☆98Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆105Updated 9 months ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- Train Gradient Boosting and Random Forest with only SQL (VLDB 2023)☆21Updated last year
- Apache Arrow Cookbook☆98Updated last month
- Unified Distributed Execution☆51Updated 2 months ago
- Apache Arrow PostgreSQL connector☆57Updated 11 months ago
- ☆14Updated 2 months ago
- ☆30Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆173Updated this week
- ☆89Updated this week
- ☆20Updated last year
- Distributed SQL Query Engine in Python using Ray☆241Updated 3 months ago
- A software engineering framework to jump start your machine learning projects☆37Updated 7 months ago
- Coming soon☆59Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆96Updated 3 weeks ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated 10 months ago
- reproducible benchmark of database-like ops☆152Updated 2 months ago
- Arrow, pydantic style☆84Updated 2 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- The SQL Standards Project aims to create consensus in SQL semantics☆45Updated 3 months ago
- A Delta Lake reader for Dask☆48Updated 3 months ago
- Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshif…☆48Updated 4 months ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆15Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆27Updated 2 years ago
- Run Numba compiled functions in SQLite☆38Updated this week