4OH4 / doc-similarity
Ranking documents using semantic similarity in Python
☆35Updated 4 years ago
Alternatives and similar repositories for doc-similarity:
Users that are interested in doc-similarity are comparing it to the libraries listed below
- ☆35Updated 3 years ago
- Named entity relevant project☆30Updated 4 years ago
- The official tool for transforming doccano format into common dataset formats.☆107Updated last year
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆53Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆159Updated 4 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆85Updated 2 years ago
- Applying BERT to named entity recognition in English and Russian.☆162Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆71Updated last year
- Train a model to find the names of products in text☆37Updated 5 years ago
- Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.☆84Updated 4 years ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modelling☆69Updated 5 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- architectures and pre-trained models for long document classification.☆154Updated 4 years ago
- Using BERT For Classifying Documents with Long Texts, check my latest post: https://armandolivares.tech/☆41Updated 5 years ago
- Steam review texting embedding analysis☆141Updated 2 years ago
- ☆15Updated last year
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆80Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 11 months ago
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆35Updated 10 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated 11 months ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated last year
- Building a text classifier with extremely small datasets☆44Updated 5 years ago
- Exploring the simple sentence similarity measurements using word embeddings☆101Updated 7 months ago
- Key information extraction from text and graph visualization☆91Updated 4 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year