malteos / awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
☆236Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-document-similarity
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆254Updated 2 weeks ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆428Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆377Updated last year
- REL: Radboud Entity Linker☆304Updated 7 months ago
- Coreference Resolution☆73Updated 3 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- The official tool for transforming doccano format into common dataset formats.☆105Updated last year
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆217Updated 4 months ago
- Few-shot Named Entity Recognition☆122Updated 2 years ago
- A curated list of awesome data annotation tools☆194Updated 2 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- A python package for text preprocessing task in natural language processing.☆63Updated 2 years ago
- ☆344Updated 3 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆160Updated 2 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆161Updated 2 weeks ago
- ☆203Updated 3 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆432Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆192Updated last year
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆154Updated last year
- Entity Disambiguation as text extraction (ACL 2022)☆177Updated 2 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆336Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆342Updated last year
- Recent trends of Entity Linking, Disambiguation, and Representation.☆344Updated 3 years ago
- Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…☆29Updated 4 years ago
- [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction☆301Updated last year
- A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.☆495Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- Deep Keyphrase Extraction using BERT☆256Updated 2 years ago