kavgan / phrase-at-scaleLinks
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
☆131Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below
Sorting:
- Generating labels for topics automatically using neural embeddings☆185Updated 4 months ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- PyTorch implementations of various deep learning models for paraphrase detection, semantic similarity, and textual entailment☆107Updated 7 years ago
- Named Entity Recognition based on dictionaries☆241Updated 6 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆75Updated 3 years ago
- Word Embeddings for Information Retrieval☆225Updated 2 years ago
- Python Framework for Extractive Text Summarization☆113Updated 4 years ago
- Transfer Learning for NLP Tasks☆55Updated 7 years ago
- One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques☆210Updated 2 years ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆138Updated 4 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 6 months ago
- A previous version of Snorkel focused on information extraction☆36Updated 6 years ago
- Neural Abstractive Text Summarization with Sequence-to-Sequence Models☆158Updated 6 years ago
- This repo contains code and dataset for the Opinosis Summarization Framework☆51Updated 6 years ago
- ☆123Updated 2 years ago
- The implementation of text classification using character level convoultion neural networks using Keras☆149Updated 2 years ago
- Character-based word embeddings model based on RNN for handling real world texts☆174Updated 2 years ago
- Automatic labeling for topic model☆57Updated 10 years ago
- Concatenate word and character embeddings in Keras☆45Updated 4 years ago
- Dataset for the Emerging & Novel Entity NER task (WNUT '17)☆112Updated 3 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆198Updated 5 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆222Updated last year
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Various Algorithms for Short Text Mining☆472Updated last week
- ☆50Updated 4 years ago
- A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.☆67Updated 9 years ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆87Updated 3 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Updated 5 years ago
- Exploring the simple sentence similarity measurements using word embeddings☆99Updated last year
- Create a knowledge base using domain specific documents and the mammoth python library☆137Updated 6 years ago