xinyandai / string-embed
string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
☆61Updated 2 years ago
Alternatives and similar repositories for string-embed
Users that are interested in string-embed are comparing it to the libraries listed below
Sorting:
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated 2 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆31Updated 2 years ago
- locality sensitive hashing (LSHASH) for Python3☆69Updated last week
- Submission archive for the MS MARCO document ranking leaderboard☆30Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- TransformerDB☆19Updated 4 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆45Updated last year
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Updated 2 months ago
- source code of bison☆26Updated 4 years ago
- Fusion for TREC run files with popular fusion techniques☆21Updated 2 years ago
- Implementation of SiameseXML (ICML 2021)☆40Updated 2 years ago
- A large scale feature extraction tool for text-based machine learning☆32Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆53Updated this week
- Standalone Neural Ranking Model (SNRM)☆76Updated 6 years ago
- ☆34Updated 4 years ago
- Knowledge graph based information retrieval☆13Updated 6 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆107Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 5 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated 2 years ago
- Anserini notebooks☆69Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆14Updated 4 years ago
- "Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation☆19Updated 6 years ago