xinyandai / string-embed
string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
☆61Updated last year
Alternatives and similar repositories for string-embed:
Users that are interested in string-embed are comparing it to the libraries listed below
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- TransformerDB☆19Updated 3 years ago
- HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.☆37Updated 2 years ago
- Framework for weakly supervised deep sequence taggers, focused on named entity recognition☆79Updated 2 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 6 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 5 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 11 months ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- Flexible classic and NeurAl Retrieval Toolkit☆216Updated last month
- A classification model☆21Updated 2 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆50Updated 4 years ago
- source code of bison☆26Updated 4 years ago
- Fusion for TREC run files with popular fusion techniques☆21Updated 2 years ago
- ☆34Updated last year
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Updated 3 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking☆14Updated 3 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- Submission archive for the MS MARCO document ranking leaderboard☆29Updated last year
- Converter from UD-trees to BART representation☆36Updated last year
- Knowledge graph based information retrieval☆13Updated 6 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- "Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation☆19Updated 6 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago