xinyandai / string-embed
string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
☆60Updated last year
Alternatives and similar repositories for string-embed:
Users that are interested in string-embed are comparing it to the libraries listed below
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 6 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.☆37Updated 2 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- Knowledge graph based information retrieval☆13Updated 6 years ago
- Implementation of SiameseXML (ICML 2021)☆40Updated 2 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago
- An opensource TAR framework for experiments and applications☆16Updated 11 months ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Neural Vector Space Models☆49Updated 6 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 5 years ago
- ECLARE: Extreme Classification with Label Graph Correlations☆42Updated 2 years ago
- Word embedding approach based on a dynamic log-linear model☆55Updated 7 years ago
- Fusion for TREC run files with popular fusion techniques☆21Updated 2 years ago
- RankDCG: ranking/ordering evaluation measure☆38Updated 3 years ago
- A large scale feature extraction tool for text-based machine learning☆32Updated 2 years ago
- Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EM…☆38Updated 2 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆32Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.☆43Updated 2 years ago
- TransformerDB☆19Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- PyTorch implementation of AVITM (Autoencoding Variational Inference For Topic Models)☆36Updated 2 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- Getting interpretable dimensions in word embedding spaces.☆14Updated last year