messense / cjieba-pyLinks
Python cffi binding to CppJieba
☆15Updated 5 years ago
Alternatives and similar repositories for cjieba-py
Users that are interested in cjieba-py are comparing it to the libraries listed below
Sorting:
- Time-NLP的Python3版本 中文时间表达识别☆91Updated 5 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。☆76Updated 4 years ago
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆65Updated 5 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆68Updated 7 years ago
- python bindings of cppjieba ,recommand jieba_fast for results consistency and speed balance☆22Updated 6 years ago
- python CRF++实现分词☆37Updated 7 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84Updated 3 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆31Updated 4 years ago
- 开天-新词,中文新词发现工具,Chinese New Word Discovery Tool☆22Updated 5 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago
- 医学预训练语言模型☆18Updated 4 years ago
- 新词发现算法(NewWordDetection)☆63Updated 8 years ago
- 2018atec蚂蚁金服NLP智能客服比赛 16th/2632☆111Updated 6 years ago
- SMP2017中文人机对话评测数据☆108Updated 8 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Updated 6 years ago
- Pure python Aho-Corasick library.☆220Updated 2 years ago
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆86Updated 2 years ago
- 一个完整bert训练到部署的案例☆23Updated 5 years ago
- 依存关系分析,NLP,自然语言处理☆85Updated 4 years ago
- 基于最小熵原理的NLP工具包☆139Updated 3 years ago
- Use BERT to train a classification model and deploy the model by tensorflow serving☆50Updated 4 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Updated 5 years ago
- 中文 NLP 语料库数据集☆20Updated 6 years ago
- export bert model for serving☆141Updated 6 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 7 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- BiLSTM+CNN+CRF NER, using pytorch☆16Updated 6 years ago
- 新词发现算法(NewWordDetection)☆93Updated 4 years ago
- 中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…☆35Updated 6 years ago