kavgan / phrase-at-scaleLinks
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
☆127Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below
Sorting:
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆139Updated 2 years ago
- Create a knowledge base using domain specific documents and the mammoth python library☆134Updated 6 years ago
- Generating labels for topics automatically using neural embeddings☆185Updated 4 months ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Implementation of the paper -> https://arxiv.org/abs/1709.00155. For converting information present in the form of structured data into n…☆188Updated 6 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 3 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆115Updated 3 years ago
- Document clustering and topic modelling with Python☆85Updated 7 years ago
- PyTorch implementations of various deep learning models for paraphrase detection, semantic similarity, and textual entailment☆107Updated 7 years ago
- Word Embeddings for Information Retrieval☆225Updated last year
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆136Updated 3 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆157Updated 6 years ago
- Python Framework for Extractive Text Summarization☆113Updated 3 years ago
- Tensorflow 1.5 implementation of Chris Moody's Lda2vec, adapted from @meereeum☆108Updated 6 years ago
- Transfer Learning for NLP Tasks☆55Updated 6 years ago
- A previous version of Snorkel focused on information extraction☆35Updated 5 years ago
- DAANet: Dual Ask-Answer Network for Machine Reading Comprehension☆144Updated 6 years ago
- Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation☆99Updated 7 months ago
- A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.☆66Updated 8 years ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆87Updated 2 years ago
- One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques☆207Updated 2 years ago
- Dataset for the Emerging & Novel Entity NER task (WNUT '17)☆111Updated 3 years ago
- An extension of word2vec to learn phrase embeddings☆75Updated 6 years ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆46Updated 6 years ago
- Code for keyphrase classification systems submitted to the SemEval 2017 shared task ScienceIE.☆36Updated 7 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- Relationship and Entity Extraction Evaluation Dataset☆80Updated 7 years ago
- ☆123Updated 2 years ago
- WWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information☆102Updated 2 years ago