megagonlabs / rotomLinks
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond"
☆23Updated 3 years ago
Alternatives and similar repositories for rotom
Users that are interested in rotom are comparing it to the libraries listed below
Sorting:
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Updated 4 years ago
- Resources for PVLDB 2023 submission☆24Updated last year
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆32Updated 2 years ago
- Annotating Columns with Pre-trained Language Models☆34Updated 3 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆17Updated 2 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆45Updated 4 years ago
- Foundation Models for Data Tasks☆110Updated 2 years ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆116Updated last year
- ☆26Updated 7 years ago
- ☆18Updated last year
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated 2 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆131Updated last month
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 4 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆302Updated last year
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆32Updated 3 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60Updated 2 years ago
- Entity resolution using zero labeled examples☆32Updated last year
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12Updated 7 months ago
- ☆32Updated 4 years ago
- CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata and Wikipedia☆169Updated last year
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Updated 5 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆35Updated 2 years ago
- ☆80Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Updated 8 months ago
- [SIGIR 2021] Retrieving Complex Tables with Multi-Granular Graph Representation Learning.☆48Updated 3 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Updated 2 years ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆19Updated 5 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆155Updated 10 months ago
- A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.☆138Updated last year