MichaelAquilina / hashedindexLinks
Python package providing an Inverted Index implementation using dictionaries
☆35Updated 4 years ago
Alternatives and similar repositories for hashedindex
Users that are interested in hashedindex are comparing it to the libraries listed below
Sorting:
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 7 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 2 weeks ago
- A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection☆60Updated 8 years ago
- Text pattern search using marisa-trie☆18Updated 4 months ago
- minimal example for sentence embedding by Smooth Inverse Frequency weighting scheme☆35Updated 7 years ago
- Collection of functions and scripts for text retrieval in Python: Document collection preprocessing, Feature Selection, Indexing, Query p…☆44Updated 12 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 7 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Contains the main implementation of programs for the paper: Reproducing and learning new algebraic operations on word embeddings using ge…☆12Updated 8 years ago
- Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT☆23Updated 2 years ago
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Updated 6 years ago
- A better working example of SIFRank and SIFRank+ models for keyword extraction. Easy to setup using docker-compose.☆11Updated 7 months ago
- maximum entropy based part-of-speech tagger for NLTK☆45Updated 8 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- k-shingling for text to help compare similarity