kohjiaxuan / NLP-Model-for-Corpus-Similarity

A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.

☆9

Alternatives and similar repositories for NLP-Model-for-Corpus-Similarity:

Users that are interested in NLP-Model-for-Corpus-Similarity are comparing it to the libraries listed below

laugustyniak / textlytics
Text processing library for sentiment analysis and related tasks
☆27Updated 6 years ago
nkthiebaut / zeugma
📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…
☆62Updated last year
sobamchan / pytorch-lightning-transformers
Fine-tune transformers with pytorch-lightning
☆44Updated 2 years ago
PacktPublishing / fastText-Quick-Start-Guide
fastText Quick Start Guide, published by Packt
☆49Updated last year
SkBlaz / rakun
Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation
☆99Updated 2 months ago
shlomihod / deep-text-eval
Differnable Readability Measure Regularizer for Neural Network Automatic Text Simplification
☆24Updated last year
dbmdz / deep-eos
General-Purpose Neural Networks for Sentence Boundary Detection
☆73Updated last year
ffancellu / NegNN
Neural Network for Automatic Negation Detection
☆20Updated 8 years ago
whcjimmy / lda2vec
☆15Updated 5 years ago
kavgan / phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in …
☆125Updated 5 years ago
huggingface / neuralcoref-viz
✨ Web interface for NeuralCoref coreference resolution
☆34Updated last year
maobedkova / TopicModelling_PySpark_SparkNLP
Tutorial for Topic Modelling using PySpark and Spark NLP
☆16Updated 4 years ago
mayhewsw / pytorch-truecaser
A simple neural truecaser written in pytorch and allennlp.
☆32Updated 7 months ago
krzysiekfonal / grammaregex
Regex like pattern tree matching but on sentence's tree instead of Strings
☆42Updated 6 years ago
sdimi / average-word2vec
🔤 Calculate average word embeddings (word2vec) from documents for transfer learning
☆54Updated 8 months ago
tca19 / dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
☆114Updated 4 years ago
IBM / WordMoversEmbeddings
WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…
☆81Updated 6 years ago
asafamr / bertwsi
Word Sense Induction with BERT MLM
☆28Updated last year
Oneplus / Tweebank
A collection of English tweets annotated in Universal Dependencies.
☆39Updated 3 years ago
jackee777 / babelnetpy
A python 3 interface for BabelNet https://babelnet.org/
☆31Updated last year
xiaohan2012 / chowmein
Automatic labeling for topic model
☆57Updated 9 years ago
Isminoula / TextNormSeq2Seq
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM…
☆36Updated 3 years ago
nicharuc / Collocations
N-gram Extraction Approaches (bigrams, trigrams)
☆42Updated 6 years ago
ozanarkancan / char-ner
Multi lingual character based named entity recognizer
☆25Updated 6 years ago
natasha / ipymarkup
NER, syntax markup visualizations
☆136Updated last year
BramVanroy / spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…
☆78Updated 6 months ago
vered1986 / Chirps
A Large Automatically-Constructed Resource of Predicate Paraphrases
☆43Updated 4 years ago
blester125 / iobes
Tool for parsing and converting various span encoding schemes.
☆22Updated last year
TakeLab / spacy-udpipe
spaCy + UDPipe
☆161Updated 2 years ago