xinyandai / string-embed
string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
☆60Updated last year
Alternatives and similar repositories for string-embed:
Users that are interested in string-embed are comparing it to the libraries listed below
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- Learned string similarity for entity names using optimal transport.☆34Updated 4 years ago
- An opensource TAR framework for experiments and applications☆16Updated 10 months ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated 2 weeks ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- TransformerDB☆19Updated 3 years ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆14Updated 4 years ago
- Fusion for TREC run files with popular fusion techniques☆22Updated 2 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆18Updated 3 years ago
- HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.☆37Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆30Updated 2 years ago
- Salient Open Information Extraction☆20Updated 6 years ago
- Knowledge graph based information retrieval☆13Updated 6 years ago
- Implementation of SiameseXML (ICML 2021)☆40Updated 2 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆105Updated 3 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 6 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆96Updated 4 months ago
- Converter from UD-trees to BART representation☆36Updated 10 months ago
- codebase for the Text-based NP Enrichment (TNE) paper☆19Updated 10 months ago
- Framework for weakly supervised deep sequence taggers, focused on named entity recognition☆79Updated last year
- Submission archive for the MS MARCO document ranking leaderboard☆28Updated last year
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 7 months ago