sea-boat / TextAnalyzer
A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence si…
☆204Updated 6 years ago
Alternatives and similar repositories for TextAnalyzer:
Users that are interested in TextAnalyzer are comparing it to the libraries listed below
- The missing SVM-based text classification module implementing HanLP's interface☆47Updated 7 years ago
- First publish for PKUSUMSUM☆245Updated 7 years ago
- ltp4j: Language Technology Platform For Java☆162Updated 3 years ago
- 中文短文句相似读☆134Updated 6 years ago
- Tree-split 搬新家..给各位带来的不便深表歉意☆56Updated 8 years ago
- 用户评论标签挖掘☆71Updated 7 years ago
- A curated list of resources for NLP (Natural Language Processing) for Chinese 中文自然语言处理相关资料☆161Updated 7 years ago
- 语义相似度计算各种算法实现汇总☆45Updated 7 years ago
- 主谓宾提取器的Java实现(对斯坦福的代码失去兴趣,不再维护)☆139Updated 9 years ago
- An Efficient Chinese Text Classifier☆206Updated 6 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- 相似度计算软件包☆190Updated last year
- mltk web edition☆41Updated 8 years ago
- 基于深度学习的自然语言处理库☆152Updated 6 years ago
- Question and Answering Model with TensorFlow☆32Updated 2 years ago
- 基于标题分类的主题句提取方法可描述为: 给定一篇新闻报道, 计算标题与新闻主题词集的相似度, 判断标题是否具有提示性。对于提示性标题,抽取新闻报道中与其最相似的句子作为主题句; 否则, 综合利用多种特征计算新闻报道中句子的重要性, 将得分最高的句子作为主题句。☆40Updated 8 years ago
- Opendial对话语料库☆50Updated 6 years ago
- THU Chinese Keyphrase Extraction Toolkit☆124Updated 6 years ago
- Implementing Facebook's FastText with java☆158Updated 4 years ago
- An interpreter module for AIML (Artificial Intelligence Markup Language), support Chinese, support Python3☆69Updated 4 years ago
- 对中文分词jieba (python版)的注解☆89Updated 6 years ago
- 一个随时恭候询问的耐心小二~☆89Updated 7 years ago
- 语义理解/口语理解,项目包含有词法分析:中文分词、词性标注、命名实体识别;口语理解:领域分类、槽填充、意图识别。☆177Updated 6 years ago
- 基于PageRank的TextRank方法, 可以应用于中文关键词、短语、摘要提取程序,代码使用Scala编写。☆126Updated 4 years ago
- word2vec/glove/swivel binary file on chinese corpus☆398Updated 8 years ago
- Details of paper cw2vec☆82Updated 6 years ago
- A Java implementation of doc2vec in ICML'14☆25Updated 9 years ago
- 一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面☆237Updated 5 years ago
- 深度学习聊天机器人资源集合 Awesome chatbot resource list☆288Updated 3 years ago
- 啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要、语义相似度计算、LDA 主题预测、词云等服务。☆302Updated 5 months ago