kavgan / phrase-at-scaleLinks
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
☆128Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below
Sorting:
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆139Updated 3 years ago
- Generating labels for topics automatically using neural embeddings☆185Updated 5 months ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 3 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Python Framework for Extractive Text Summarization☆113Updated 3 years ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆137Updated 3 years ago
- Transfer Learning for NLP Tasks☆55Updated 6 years ago
- A previous version of Snorkel focused on information extraction☆35Updated 5 years ago
- PyTorch implementations of various deep learning models for paraphrase detection, semantic similarity, and textual entailment☆107Updated 7 years ago
- Word Embeddings for Information Retrieval☆225Updated last year
- A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)☆153Updated 5 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated last month
- One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques☆209Updated 2 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- A system for learning word weights, optimised for sentence-level vector similarity☆42Updated 9 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆197Updated 5 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆156Updated 6 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 8 years ago
- Create a knowledge base using domain specific documents and the mammoth python library☆135Updated 6 years ago
- Document clustering and topic modelling with Python☆87Updated 7 years ago
- The implementation of text classification using character level convoultion neural networks using Keras☆150Updated 2 years ago
- Implementation of Hobbs' algorithm for coreference resolution in python☆44Updated 5 years ago
- Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019☆30Updated 6 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆221Updated last year
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Updated 4 years ago
- This is our team's solution report, which achieves top 10% (305/3307) in this competition.☆59Updated 8 years ago
- Code for unsupervised aspect extraction, using Keras and its Backends☆91Updated 2 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- The implementation of the paper "A Multi-task Approach for Named Entity Recognition on Social Media Data," which won the WNUT-2017 Shared…☆68Updated 7 years ago