suhailrehman / fuzzydataLinks
Fuzzy Data Benchmark
☆17Updated last year
Alternatives and similar repositories for fuzzydata
Users that are interested in fuzzydata are comparing it to the libraries listed below
Sorting:
- Ibis Substrait Compiler☆105Updated last week
- A Python-to-SQL transpiler as replacement for Python Pandas☆49Updated 2 years ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- ☆53Updated 4 months ago
- Unified Distributed Execution☆57Updated last year
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆251Updated this week
- ☆34Updated 2 years ago
- ☆116Updated last month
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated last year
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆64Updated last year
- Dias: Dynamic Rewriting of Pandas Code☆79Updated 4 months ago
- reproducible benchmark of database-like ops☆176Updated 3 weeks ago
- The SQL Standards Project aims to create consensus in SQL semantics☆47Updated last year
- Arrow, pydantic style☆85Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆214Updated last year
- Train Gradient Boosting and Random Forest with only SQL (VLDB 2023)☆24Updated 2 years ago
- ☆80Updated 3 years ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆109Updated this week
- A work-in-progress book on Dask☆12Updated 2 years ago
- Apache Arrow PostgreSQL connector☆62Updated last year
- FlorDB 🌻☆155Updated last month
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆41Updated 2 years ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- Python bindings for sqlparser-rs☆199Updated 6 months ago
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆14Updated 9 months ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- A SQL parser☆62Updated last month