ekzhu / josieLinks
Code and Benchmarks for JOSIE (SIGMOD 2019)
☆19Updated 2 years ago
Alternatives and similar repositories for josie
Users that are interested in josie are comparing it to the libraries listed below
Sorting:
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- A Jupyter notebook extension to centralize and manage data☆15Updated 2 years ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆90Updated 2 months ago
- A System for Optimized Semantic Computation☆127Updated this week
- ☆19Updated 3 years ago
- ☆9Updated last year
- Language Models as Multi-Modal Query Planners☆13Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- ☆79Updated 2 years ago
- ☆60Updated 2 months ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆34Updated last month
- Project overview and links to various resources☆19Updated 3 years ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆51Updated this week
- Graph Engine for Exploration and Search☆42Updated last year
- Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engin…☆141Updated 3 years ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆21Updated 2 years ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆43Updated 4 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆80Updated 5 months ago
- Source code for several Metanome data profiling algorithms☆57Updated 2 years ago
- Implementation of DeepDB: Learn from Data, not from Queries!☆102Updated 2 years ago
- LSH index for approximate set containment search☆58Updated 3 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- FDX, SIGMOD 2020☆19Updated last year
- This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…☆168Updated last year
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Updated last year
- SkinnerDB is an analytical database management system. It uses adaptive processing and reinforcement learning to find near-optimal join o…☆49Updated last year
- Pollock is a benchmark for data loading on character-delimited files.☆20Updated 4 months ago
- Paper list about adopting machine learning techniques into data management tasks.☆37Updated 5 years ago
- Apache datasketches☆33Updated 5 months ago