lionsoul2014 / jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
☆914Updated last year
Related projects ⓘ
Alternatives and complementary repositories for jcseg
- 这个项目是一个基本包.封装了大多数nlp项目中常用工具☆1,496Updated 7 months ago
- No longer maintained. Please contact the origional author.☆655Updated 6 years ago
- ☆638Updated 4 months ago
- HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统☆297Updated 4 years ago
- An Efficient Lexical Analyzer for Chinese☆328Updated 6 years ago
- Java分布式中文分词组件 - word分词☆1,818Updated 3 years ago
- mmseg4j for lucene or solr analyzer☆398Updated 9 months ago
- Java开源项目cws_evaluation:中文分词器分词效果评估对比☆949Updated 7 years ago
- 基于hanlp的elasticsearch分词插件☆156Updated 3 years ago
- HanLP Analyzer for Elasticsearch☆832Updated 4 months ago
- 一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)☆678Updated 10 months ago
- mmseg4j core MMSEG for java chinese analyzer☆157Updated 5 years ago
- The Mmseg Analysis plugin integrates Lucene mmseg4j-analyzer//code.google.com/p/mmseg4j/ into elasticsearch, support customized dictionar…☆359Updated 3 years ago
- word2vec java版本的一个实现☆694Updated 3 years ago
- jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1☆526Updated 10 months ago
- TextRank算法提取关键词的Java实现☆201Updated 9 years ago
- 🚲 STConvert is analyzer that convert chinese characters between traditional and simplified.中文简繁體互相转换.☆359Updated 6 months ago
- 一款运行于Elasticsearch之上的中文拼音智能分词插件,支持全拼、首字母、中文混合搜索☆155Updated 11 months ago
- 结巴分词(java版)☆2,587Updated 4 months ago
- The dynamic synonym plugin adds a synonym token filter that reloads the synonym file(local file or remote file) at given intervals (defau…☆372Updated last year
- Java porting of Darts (Double ARray Trie System)☆268Updated 6 years ago
- IKAnalyzer for Solr5☆143Updated 7 years ago
- The plugin includes the `jieba` analyzer, `jieba` tokenizer, and `jieba` token filter, and have two mode you can choose. one is `index` w…☆314Updated 3 years ago
- HanLP Analysis for Elasticsearch☆89Updated 5 years ago
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆650Updated 11 months ago
- 中文语句中的时间语义识别。即通过分析中文语句,识别出话语中提到的时间。☆629Updated 11 months ago
- Chinese Word Segmentation Tool, THULAC的Java实现.☆85Updated 3 years ago
- 🛵 This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆2,961Updated 6 months ago
- A simple implementation of simhash algorithm by java.☆154Updated 4 years ago