steven-s / text-shingles
k-shingling for text to help compare similarity
☆19Updated 5 years ago
Alternatives and similar repositories for text-shingles:
Users that are interested in text-shingles are comparing it to the libraries listed below
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 7 years ago
- SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model☆36Updated 7 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Updated 6 years ago
- ☆43Updated 8 years ago
- ☆20Updated 6 years ago
- Experiment with document similarity via Matt Kusner's MWD paper☆24Updated 8 years ago
- Key-phrase extraction for research publications using graph-representation of texts and centrality measures☆19Updated 9 years ago
- Facilitate the learning, practicing, and designing of neural text matching models with a user-friendly and interactive interface.☆38Updated 2 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- Creates knowledge graph from information processed by "Entity Extraction and Linking" module, and "Emotion Recognition from Text" module☆36Updated 7 years ago
- UNSUPPORTED & OUTDATED: Derive named entities from Wikipedia☆47Updated 5 years ago
- Subword based Pairwise Word Interaction Model for Paraphrase Identification☆22Updated 6 years ago
- Python toolkit for ranking experiments on sentence/summary data☆24Updated 2 years ago
- Implementation of Attention-Based Neural Matching Model Proposed in CIKM16 for Answer Sentence Selection☆42Updated 7 years ago
- Neural Reranking for Named Entity Recognition, accepted as regular paper at RANLP 2017☆23Updated 7 years ago
- Identify Events from text using Natural Language Processing Modules☆33Updated 8 years ago
- Very Simple Question Answer System using Chinese Wikipedia Data☆23Updated 9 months ago
- CRFs based Chinese word segmentor☆19Updated 10 years ago
- A program to correct non-word spelling error in sentences using ngram MAP Language Models, Noisy Channel Model, Error Confusion Matrix an…☆53Updated 4 years ago
- QA - Answer Selection (Rank candidate answers for a given question)☆36Updated 6 years ago
- Event extraction pipeline.☆34Updated 7 years ago
- Word and text similarity measures☆54Updated 2 years ago
- Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network☆21Updated 8 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆21Updated 4 years ago
- Named Entity Disambiguation for Noisy Text☆66Updated 7 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- SegPhrase working on Chinese and Arabic☆35Updated 8 years ago
- LDA topic modeling with word2vec using gaussian topic distributions for infinite vocabulary☆51Updated 9 years ago
- The code for COPACRR Neural IR model.☆37Updated 7 years ago
- Attempt at using LSTMs to predict semantic relatedness of sentences (a la Tai et al. in Improved Semantic Representations From Tree-Struc…☆22Updated 9 years ago