kavgan / phrase-at-scaleLinks

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

☆127

Alternatives and similar repositories for phrase-at-scale

Users that are interested in phrase-at-scale are comparing it to the libraries listed below

Sorting:

napsternxg / TwitterNER
Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html
☆139Updated 2 years ago
sb1992 / NETL-Automatic-Topic-Labelling-
Generating labels for topics automatically using neural embeddings
☆185Updated 5 months ago
snorkel-team / snorkel-extraction
A previous version of Snorkel focused on information extraction
☆35Updated 5 years ago
nateraw / Lda2vec-Tensorflow
Tensorflow 1.5 implementation of Chris Moody's Lda2vec, adapted from @meereeum
☆109Updated 6 years ago
lgalke / vec4ir
Word Embeddings for Information Retrieval
☆225Updated last year
gaetangate / text-summarizer
Python Framework for Extractive Text Summarization
☆113Updated 3 years ago
mpuig / spacy-lookup
Named Entity Recognition based on dictionaries
☆242Updated 6 years ago
clips / clinspell
Clinical spelling correction with word and character n-gram embeddings.
☆74Updated 3 years ago
crownpku / text2vec
Easily generate document/paragraph/sentence vectors and calculate similarity.
☆136Updated 3 years ago
roamanalytics / mittens
A fast implementation of GloVe, with optional retrofitting
☆244Updated 2 years ago
CogComp / talen
A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities
☆115Updated 3 weeks ago
utkuozbulak / unsupervised-learning-document-clustering
Document clustering and topic modelling with Python
☆86Updated 7 years ago
prateekjoshi565 / ULMFiT_Text_Classification
Transfer Learning for NLP Tasks
☆55Updated 6 years ago
kavgan / opinosis-summarization
This repo contains code and dataset for the Opinosis Summarization Framework
☆51Updated 5 years ago
datquocnguyen / jPTDP
Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
☆156Updated 6 years ago
aoldoni / tetre
TETRE: a Toolkit for Exploring Text for Relation Extraction
☆75Updated 8 years ago
mwydmuch / extremeText
Library for fast text representation and extreme classification.
☆151Updated 4 years ago
dperezrada / keywords2vec
☆123Updated 2 years ago
amansrivastava17 / embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
☆206Updated 2 years ago
sfu-discourse-lab / SOCC
SFU Opinion and Comments Corpus
☆91Updated last month
atefm / pDMM
Python implemetation for Dirichlet Multinomial Mixture (DMM) model
☆47Updated 3 years ago
akanimax / natural-language-summary-generation-from-structured-data
Implementation of the paper -> https://arxiv.org/abs/1709.00155. For converting information present in the form of structured data into n…
☆188Updated 6 years ago
tca19 / dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
☆114Updated 4 years ago
vzhong / embeddings
Fast, DB Backed pretrained word embeddings for natural language processing.
☆222Updated 4 months ago
zelandiya / keyword-extraction-datasets
Different datasets for developing and testing keyword extraction algorithms
☆109Updated 10 years ago
tuzhucheng / sentence-similarity
PyTorch implementations of various deep learning models for paraphrase detection, semantic similarity, and textual entailment
☆107Updated 7 years ago
mhjabreel / CharCnn_Keras
The implementation of text classification using character level convoultion neural networks using Keras
☆150Updated 2 years ago
stephenhky / PyShortTextCategorization
Various Algorithms for Short Text Mining
☆472Updated this week
ArtificiAI / Multilingual-Latent-Dirichlet-Allocation-LDA
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
☆84Updated last year
hltcoe / EventMiner
Event extraction pipeline.
☆34Updated 7 years ago