megagonlabs / rotomLinks
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond"
☆23Updated 3 years ago
Alternatives and similar repositories for rotom
Users that are interested in rotom are comparing it to the libraries listed below
Sorting:
- Resources for PVLDB 2023 submission☆24Updated last year
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆33Updated 2 years ago
- ☆18Updated last year
- Foundation Models for Data Tasks☆110Updated 2 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆17Updated 2 years ago
- Entity resolution using zero labeled examples☆30Updated last year
- To reproduce experiments of the paper "Entity Matching with Transformer Architectures"☆27Updated 5 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆294Updated last year
- ☆26Updated 7 years ago
- Annotating Columns with Pre-trained Language Models☆34Updated 3 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆32Updated 3 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆18Updated 3 years ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆115Updated last year
- CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata and Wikipedia☆168Updated last year
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆45Updated 3 years ago
- ☆80Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆64Updated 2 years ago
- ☆61Updated 3 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆129Updated 3 years ago
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆107Updated last year
- This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: co…☆12Updated 7 months ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆91Updated 3 years ago
- ☆32Updated 4 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆154Updated 8 months ago
- ArcheType uses LLMs to automatically assign custom labels to your tabular data☆17Updated 5 months ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆58Updated 4 years ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆11Updated 5 months ago
- A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.☆134Updated last year