ekzhu / josieLinks
Code and Benchmarks for JOSIE (SIGMOD 2019)
☆18Updated 2 years ago
Alternatives and similar repositories for josie
Users that are interested in josie are comparing it to the libraries listed below
Sorting:
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- ☆60Updated 5 months ago
- LSH index for approximate set containment search☆60Updated 3 years ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆36Updated 4 months ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆97Updated last month
- A System for Optimized Semantic Computation☆164Updated last week
- Source code for several Metanome data profiling algorithms☆59Updated 2 years ago
- A Jupyter notebook extension to centralize and manage data☆15Updated 2 years ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆60Updated 2 months ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- Language Models as Multi-Modal Query Planners☆16Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- Graph Engine for Exploration and Search☆42Updated last year
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆90Updated 8 months ago
- ☆20Updated 3 years ago
- Foundation Models for Data Tasks☆110Updated 2 years ago
- ☆27Updated 3 years ago
- Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engin…☆143Updated 3 years ago
- Apache datasketches☆37Updated 3 months ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆14Updated last year
- This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…☆176Updated last year
- Characterization of relational table embeddings (VLDB 2024).☆32Updated last year
- Your worst case is our best case.☆143Updated 8 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- Labelled Subgraph Query Benchmark – A lightweight benchmark suite focusing on subgraph matching queries. Note: This is a microbenchmark f…☆35Updated 6 months ago
- DuckDB is an in-process SQL OLAP Database Management System☆47Updated 3 weeks ago
- FDX, SIGMOD 2020☆19Updated last year
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆37Updated 9 months ago
- Implementation of DeepDB: Learn from Data, not from Queries!☆103Updated 2 years ago