JZCS2018 / SMATLinks
Model and datasets for schema matching
☆11Updated 3 years ago
Alternatives and similar repositories for SMAT
Users that are interested in SMAT are comparing it to the libraries listed below
Sorting:
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆33Updated 3 months ago
- The source code of the Sudowoodo paper in ICDE 2023☆15Updated 2 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated last year
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆25Updated 2 years ago
- Characterization of relational table embeddings (VLDB 2024).☆30Updated 11 months ago
- Retrieval-Augmented Generation-based Relation Extraction☆39Updated this week
- BERTMap: A BERT-Based Ontology Alignment System☆65Updated last year
- ☆26Updated last year
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆44Updated 3 years ago
- This is the repository for TimelineQA, a benchmark for querying lifelogs.☆23Updated last year
- LLM-Augmented Entity Linking☆12Updated 10 months ago
- Foundation Models for Data Tasks☆106Updated 2 years ago
- ☆10Updated 4 months ago
- ☆38Updated 3 months ago
- ☆16Updated this week
- ☆13Updated 2 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆26Updated 7 months ago
- scrapper for various science databases☆11Updated last year
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆88Updated last week
- A general-purpose library for cross-document NLP modelling and analysis☆11Updated last year
- SciRepEval benchmark training and evaluation scripts☆74Updated last year
- ☆91Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆19Updated 2 years ago
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆67Updated last month
- ☆24Updated 2 years ago
- Framework for Cost-Effective Language Model Choice☆13Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆19Updated last year
- General Fine-Tuning: A little language for Deep Nets (ACL-2022 Tutorial)☆16Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆22Updated last year