megagonlabs / rotomLinks
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond"
☆23Updated 3 years ago
Alternatives and similar repositories for rotom
Users that are interested in rotom are comparing it to the libraries listed below
Sorting:
- Resources for PVLDB 2023 submission☆24Updated last year
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Updated 3 years ago
- Annotating Columns with Pre-trained Language Models☆33Updated 3 years ago
- ☆18Updated last year
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆45Updated 3 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆17Updated 2 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆33Updated 2 years ago
- Entity resolution using zero labeled examples☆30Updated last year
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆293Updated last year
- Foundation Models for Data Tasks☆109Updated 2 years ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆11Updated 5 months ago
- Characterization of relational table embeddings (VLDB 2024).☆31Updated last year
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆32Updated 3 years ago
- ☆26Updated 7 years ago
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Updated 2 months ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆92Updated 4 months ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆115Updated last year
- CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata and Wikipedia☆168Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆31Updated last year
- Code and data for "TURL: Table Understanding through Representation Learning"☆127Updated 3 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆18Updated 3 years ago
- 🕸️ A graph-augmented dense statute retriever. (EACL 2023)☆23Updated 2 years ago
- ArcheType uses LLMs to automatically assign custom labels to your tabular data☆17Updated 4 months ago
- The source code for self-supervised Taxonomy Completion framework TaxoEnrich, published in WWW 2022.☆21Updated 3 years ago
- ReFinED is an efficient and accurate entity linking (EL) system.☆219Updated 9 months ago
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆107Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Updated last year
- ☆30Updated 2 years ago
- ☆79Updated last year