explosion / spacy-pkusegLinks
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
☆62Updated 3 weeks ago
Alternatives and similar repositories for spacy-pkuseg
Users that are interested in spacy-pkuseg are comparing it to the libraries listed below
Sorting:
- 各大中文分词性能评测☆157Updated 6 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆56Updated 3 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆142Updated 6 months ago
- A convenient Chinese word segmentation tool 简便中文分词器☆46Updated last month
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆245Updated 2 years ago
- Minimal keyword extraction with BERT☆84Updated 3 years ago
- 时间抽取、解析、标准化工具☆52Updated 2 years ago
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 2 years ago
- 中文纠错☆92Updated 3 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆94Updated 2 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆76Updated 10 months ago
- Pytorch model for https://github.com/imcaspar/gpt2-ml☆79Updated 3 years ago
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆144Updated last year
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆64Updated last year
- 渊 - A project for Classical Chinese☆105Updated 3 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated last month
- ☆125Updated 4 years ago
- MiniRBT (中文小型预训练模型系列)☆282Updated 2 years ago
- 大规模中文语料☆42Updated 5 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆172Updated 6 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆214Updated 2 years ago
- rasa_chinese 专门针对中文语言的 rasa 组件扩展包,提供了许多针对中文语言的组件☆150Updated 2 years ago
- ☆34Updated 3 years ago
- 基于Pytorch 1.0 实现的中文断句与标点符号恢复。☆58Updated 6 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆52Updated 2 years ago
- PERT: Pre-training BERT with Permuted Language Model☆362Updated 2 years ago
- 基于bert进行中文文本纠错☆235Updated 2 years ago
- ChatGLM-6B fine-tuning.☆135Updated 2 years ago