juneau-project / juneau
A Jupyter notebook extension to centralize and manage data
☆14Updated 2 years ago
Alternatives and similar repositories for juneau
Users that are interested in juneau are comparing it to the libraries listed below
Sorting:
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- ☆27Updated 6 years ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆18Updated 2 years ago
- Project overview and links to various resources☆19Updated 3 years ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆88Updated last month
- D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf☆20Updated 3 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆44Updated 3 years ago
- Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index☆42Updated last year
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆112Updated last year
- Explaining Inference Queries with Bayesian Optimization☆10Updated 4 years ago
- Graph Engine for Exploration and Search☆40Updated last year
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆20Updated last year
- ☆11Updated last year
- Linked SPARQL Queries (LSQ): Framework for RDFizing triple store (web) logs and performing SPARQL query extraction, analysis and benchmar…☆26Updated 6 months ago
- Pattern-based table discovery in Open Data CSV files☆25Updated 2 years ago
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆32Updated 2 months ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated last year
- DuckDB is an in-process SQL OLAP Database Management System☆43Updated last week
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats…☆16Updated 2 years ago
- Benchmarking the Chase☆9Updated 7 years ago
- This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…☆161Updated 9 months ago
- Pollock is a benchmark for data loading on character-delimited files.☆17Updated last month
- An integration of KùzuDB and RDFlib.☆13Updated 5 months ago
- Reference implementations for the LDBC Social Network Benchmark's Business Intelligence (BI) workload☆42Updated last month
- Annotating Columns with Pre-trained Language Models☆33Updated 2 years ago
- ☆24Updated 3 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆76Updated last week
- ☆19Updated 3 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated 2 years ago