jannson / simhash-py
Simhash and near-duplicate detection
☆17Updated 10 years ago
Related projects: ⓘ
- A Slot-filling based Dialog Manager for Task-oriented Bot☆11Updated 7 years ago
- This code is for Convolutional Latent Semantic Model, which is similay with DSSM(Deep Semantic Similarity Model).☆25Updated 9 years ago
- A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries☆12Updated 3 years ago
- ☆10Updated this week
- 新词发现☆68Updated 10 years ago
- A HMM-like linear-chain CRF, used Tensorflow API.☆37Updated 6 years ago
- machine reading comprehension with deep learning☆20Updated 6 years ago
- ☆13Updated 5 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆49Updated 8 years ago
- ☆64Updated this week
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Updated 4 years ago
- CMU RavenClaw对话管理☆12Updated 6 years ago
- I read papers, and here are my highlights.☆16Updated 4 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- named entity recognition combined with rule from entity dict☆12Updated 4 years ago
- Chinese new word discovery☆42Updated 3 weeks ago
- Dilation Gate CNN For Machine Reading Comprehension☆17Updated last year
- Sub-Character Representation Learning☆25Updated 6 years ago
- Tools used to do Chinese Word Segmentation☆23Updated 10 years ago
- ☆15Updated 7 years ago
- A C++ version GBDT tool. Very fast at single machine. No time to make a distribution version.☆22Updated 8 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆36Updated 5 years ago
- Implementation of semantic question matching with deep learning approaches mentioned in the blog of Quora.☆15Updated 7 years ago
- 该代码是基于字典树对word的识别结果进行矫正,使用于中英文混合的字典。字典树(trietree):常用应用于大量字符串的保存、统计、查找等操作。☆13Updated 7 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- Deep structured semantic model☆32Updated 8 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- Chinese Word Segmentation using CRF++☆25Updated 10 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆11Updated 5 years ago
- pytorch版bert权重 转tf☆21Updated 4 years ago