lionsoul2014 / jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
☆920Updated last year
Alternatives and similar repositories for jcseg
Users that are interested in jcseg are comparing it to the libraries listed below
Sorting:
- 这个项目是一个基本包.封装了大多数nlp项目中常用工具☆1,504Updated last year
- HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统☆298Updated 4 years ago
- Java分布式中文分词组件 - word分词☆1,822Updated 4 years ago
- ☆640Updated this week
- No longer maintained. Please contact the origional author.☆663Updated 7 years ago
- An Efficient Lexical Analyzer for Chinese☆332Updated 7 years ago
- mmseg4j for lucene or solr analyzer☆398Updated last year
- Java开源项目cws_evaluation:中文分词器分词效果评估对比☆951Updated 7 years ago
- 一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)☆681Updated last year
- 基于hanlp的elasticsearch分词插件☆157Updated 3 years ago
- 结巴分词(java版)☆2,633Updated 10 months ago
- mmseg4j core MMSEG for java chinese analyzer☆156Updated 6 years ago
- jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1☆533Updated last year
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,520Updated last year
- word2vec java版本的一个实现☆700Updated 4 years ago
- HanLP Analyzer for Elasticsearch☆842Updated 10 months ago
- Java porting of Darts (Double ARray Trie System)☆270Updated 6 years ago
- A configurable web spider with a easy-to-use web console☆994Updated 6 years ago
- TextRank算法提取关键词的Java实现☆203Updated 10 years ago
- paoding-rose 提供最好用的Java Web应用整体性框架。☆603Updated 6 years ago
- 一款运行于Elasticsearch之上的中文拼音智能分词插件,支持全拼、首字母、中文混合搜索☆156Updated last year
- 中文语 句中的时间语义识别。即通过分析中文语句,识别出话语中提到的时间。☆639Updated last year
- QuestionAnsweringSystem是一个Java实现的人机问答系统,能够自动分析问题并给出候选答案。☆1,957Updated 6 years ago
- ltp4j: Language Technology Platform For Java☆161Updated 4 years ago
- Chinese Word Segmentation Tool, THULAC的Java实现.☆84Updated 4 years ago
- 相似度计算软件包☆190Updated last year
- 简易敏感词处理器,支持返回敏感词,高亮敏感词,替换敏感词等操作☆263Updated 7 years ago
- The dynamic synonym plugin adds a synonym token filter that reloads the synonym file(local file or remote file) at given intervals (defau…☆378Updated last year
- HanLP Analysis for Elasticsearch☆89Updated 6 years ago
- Tokenizer support Lucene5/6/7/8/9+ version, LTS☆206Updated last year