malteos / awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
☆246Updated 2 years ago
Alternatives and similar repositories for awesome-document-similarity:
Users that are interested in awesome-document-similarity are comparing it to the libraries listed below
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆387Updated last year
- ☆345Updated 3 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 5 months ago
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago
- Entity Disambiguation as text extraction (ACL 2022)☆182Updated 3 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆222Updated 2 years ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆430Updated last year
- A curated list of awesome data annotation tools☆209Updated 2 years ago
- [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction☆306Updated 2 years ago
- Software that makes labeling PDFs easy.☆413Updated 11 months ago
- The official tool for transforming doccano format into common dataset formats.☆106Updated 2 years ago
- Autoregressive Entity Retrieval☆786Updated last year
- REL: Radboud Entity Linker☆308Updated last year
- Coreference Resolution☆76Updated 4 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆172Updated 2 years ago
- Deep Keyphrase Extraction using BERT☆258Updated 3 years ago
- Research framework for low resource text classification that allows the user to experiment with classification models and active learning…☆102Updated 3 years ago
- SummVis is an interactive visualization tool for text summarization.☆252Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆167Updated 2 years ago
- ☆208Updated 4 years ago
- Active Learning for Text Classification in Python☆613Updated 2 weeks ago
- Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.☆85Updated 4 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆341Updated 3 months ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆133Updated 9 months ago
- multimodal document analysis☆164Updated 10 months ago
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆95Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 11 months ago
- BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision☆291Updated 3 years ago