suhailrehman / fuzzydataLinks
Fuzzy Data Benchmark
☆17Updated last year
Alternatives and similar repositories for fuzzydata
Users that are interested in fuzzydata are comparing it to the libraries listed below
Sorting:
- Ibis Substrait Compiler☆105Updated this week
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- Unified Distributed Execution☆56Updated 11 months ago
- Train Gradient Boosting and Random Forest with only SQL (VLDB 2023)☆25Updated last year
- ☆107Updated this week
- reproducible benchmark of database-like ops☆172Updated 3 months ago
- ☆34Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 11 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆240Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- Distributed SQL Engine in Python using Dask☆407Updated last year
- Arrow, pydantic style☆84Updated 2 years ago
- ☆48Updated 2 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- The SQL Standards Project aims to create consensus in SQL semantics☆47Updated 11 months ago
- Python binding for DataFusion☆59Updated 3 years ago
- Apache Arrow Cookbook☆103Updated 2 weeks ago
- DuckDB is an in-process SQL OLAP Database Management System☆44Updated 2 months ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆108Updated this week
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- reproducible benchmark of database-like ops☆340Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- Apache Arrow PostgreSQL connector☆62Updated last year
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated 2 years ago
- Coming soon☆62Updated last year
- ☆90Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆221Updated this week
- A Python package that parses sql and converts it to ibis expressions☆55Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated last month