xizhicode / ahocorasick-pythonLinks
AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。
☆74Updated 4 years ago
Alternatives and similar repositories for ahocorasick-python
Users that are interested in ahocorasick-python are comparing it to the libraries listed below
Sorting:
- CCKS2019评测任务五-公众公司公告信息抽取,第3名☆122Updated 5 years ago
- Time-NLP的Python3版本 中文时间表达识别☆91Updated 5 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆91Updated 5 years ago
- transformers implement (architecture, task example, serving and more)☆96Updated 3 years ago
- Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(fu…☆25Updated 4 years ago
- 使用python实现了一个简单的trie树结构,可 增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆65Updated 5 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆62Updated 4 years ago
- NLP的数据增强Demo☆48Updated 5 years ago
- 基于BERT的无监督分词和句法分析☆110Updated 5 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆178Updated 5 years ago
- ☆57Updated 3 years ago
- ☆92Updated 5 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- 在bert4keras下加载CPM_LM模型☆51Updated 4 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆93Updated 5 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆147Updated 6 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆132Updated 2 years ago
- Mining synonyms from unstructured and semi-structured data☆250Updated 9 months ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 7 years ago
- 中文文本纠错模型,keras实现☆75Updated 4 years ago
- 基于rasa_框架实现指自然语言相关功能:实体识别、文本分类、代消解功能、关系抽取等☆17Updated 2 years ago
- ☆90Updated 5 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- 首届中文NL2SQL挑战赛复赛方案,评估数据集acc:0.85 复赛线上成绩: 0.833 Top15☆68Updated 4 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 5 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆141Updated 5 years ago
- Word similarity computation based on Tongyici Cilin☆121Updated 8 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- 基于 Tensorflow,仿 Scikit-Learn 设计的深度学习自然语言处理框架。支持 40 余种模型类,涵盖语言模型、文本分类、NER、MRC、知识蒸馏等各个领域☆117Updated 2 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 5 years ago