houbb / opencc4j
🇨🇳Open Chinese Convert is an opensource project for conversion between Traditional Chinese and Simplified Chinese.(java 中文繁简体转换)
☆470Updated last year
Related projects: ⓘ
- The high performance pinyin tool for java.(java 高性能中文转拼音工具。支持同音字。)☆234Updated last year
- A copy of http://sourceforge.net/projects/pinyin4j, then deploy it to maven central repository.☆1,235Updated last year
- 拼音和汉字之间的转换、简体汉字和繁体汉字之间的转换☆136Updated last year
- The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)☆139Updated 6 months ago
- 中文工具集,包括中文简繁体转换、拼音转换以及中文分词。☆179Updated 9 years ago
- Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywo…☆911Updated last year
- 一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)☆675Updated 8 months ago
- ☆487Updated this week
- 这个项目是一个基本包.封装了大多数nlp项目中常用工具☆1,490Updated 5 months ago
- 结巴分词(java版)☆2,550Updated 2 months ago
- 简易敏感词处理器,支持返回敏感词,高亮敏感词,替换敏感词等操作☆261Updated 6 years ago
- 日历、公历(阳历)、农历(阴历、老黄历)、佛历、道历,支持节假日、星座、儒略日、干支、生肖、节气、节日、彭祖百忌、每日宜忌、吉神宜趋凶煞宜忌、吉神(喜神/福神/财神/阳贵神/阴贵神)方位、胎神方位、冲煞、纳音、星宿、八字、五行、十神、建除十二值星、青龙名堂等十二神、黄道日及…☆686Updated 6 months ago
- 中国农历的Java实现,支持约300年公历范围:1850-02-12到2150-12-31;一个Java类不到1000行,不依赖任何第三方库。☆104Updated 2 years ago
- 🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)☆229Updated last year
- 纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.☆452Updated 9 months ago
- xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达式和时间NLP等的工具,使用Java8(JSR-310),线程安全,简单易用,多达70几种常用日期格式化模板,支持Java8时间类和Date,轻量级,无第三方依赖。☆320Updated 2 years ago
- 音频转码工具,主要用于将微信语音 amr 格式转换为 mp3 格式以便在 html5 的 audio 标签中进行播放☆217Updated 4 years ago
- kaptcha - A kaptcha generation engine.☆443Updated 5 years ago
- 简繁体汉字转拼音的项目,解决多音字的问题。ElasticSearch、solr 的拼音分词工具☆113Updated 3 years ago
- No longer maintained. Please contact the origional author.☆653Updated 6 years ago
- Java Image I/O reader and writer for the Google WebP image format without system native libs☆157Updated 4 years ago
- Plain Java unrar library☆286Updated 4 months ago
- Tokenizer support Lucene5/6/7/8/9+ version, LTS☆200Updated 9 months ago
- An Efficient Lexical Analyzer for Chinese☆326Updated 6 years ago
- HanLP Analyzer for Elasticsearch☆825Updated 2 months ago
- Tencent Cloud API 3.0 SDK for Java☆519Updated this week
- Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR e…☆606Updated 3 years ago
- 如艺术一般优雅,像 1、2、3 一样简单,前后端通用,轻量却强大的 HTTP 客户端(同时支持 WebSocket 与 Stomp 协议)☆482Updated 3 weeks ago
- 🚲 STConvert is analyzer that convert chinese characters between traditional and simplified.中文简繁體互相转换.☆353Updated 4 months ago
- Twitter的雪花算法SnowFlake,使用Java语言实现。☆844Updated 6 years ago