malteos / awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
☆240Updated 2 years ago
Alternatives and similar repositories for awesome-document-similarity:
Users that are interested in awesome-document-similarity are comparing it to the libraries listed below
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆381Updated last year
- A curated list of awesome data annotation tools☆200Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆255Updated 2 months ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆338Updated last week
- ☆344Updated 3 years ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆428Updated last year
- A python package for text preprocessing task in natural language processing.☆63Updated 2 years ago
- Entity Disambiguation as text extraction (ACL 2022)☆178Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆107Updated last year
- ☆203Updated 3 years ago
- Autoregressive Entity Retrieval☆776Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction☆305Updated 2 years ago
- The official tool for transforming doccano format into common dataset formats.☆106Updated last year
- architectures and pre-trained models for long document classification.☆154Updated 4 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆163Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆217Updated 6 months ago
- Exploring the simple sentence similarity measurements using word embeddings☆100Updated 4 months ago
- Coreference Resolution☆74Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 2 years ago
- A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.☆494Updated 2 years ago
- Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)☆158Updated 5 years ago
- Recent trends of Entity Linking, Disambiguation, and Representation.☆344Updated 3 years ago
- LexRank algorithm for text summarization☆229Updated 9 months ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆158Updated 4 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆175Updated 5 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆166Updated 2 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆72Updated last year