yuikns / icwb2-data
This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also included is the script used to score the results submitted by the bakeoff participants and the simple segmenter used to generate the baseline and topline data.
☆65Updated 6 years ago
Related projects: ⓘ
- python CRF++实现分词☆37Updated 6 years ago
- 基于轻量级的albert实现albert+BiLstm+CRF☆87Updated last year
- 各大中文分词性能评测☆151Updated 5 years ago
- 新词发现 基于词频、 凝聚系数和左右邻接信息熵☆122Updated 4 years ago
- 人民日报1998年1-4月中文标注语料库☆28Updated 5 years ago
- 基于BERT的无监督分词和句法分析☆109Updated 4 years ago
- SMP2017中文人机对话评测数据☆106Updated 6 years ago
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆85Updated last year
- 依存关系分析,NLP,自然语言处理☆86Updated 2 years ago
- E-Commerce Sentiment Dict☆125Updated 6 years ago
- A curated list of resources of chinese corpora for NLP(Natural Language Processing)☆73Updated 4 years ago
- 使用BERT模型进行文本分类,相似句子判断,以及词性标注☆87Updated 5 years ago
- ☆75Updated last year
- Word similarity computation based on Tongyici Cilin☆117Updated 7 years ago
- 新词发现算法(NewWordDetection)☆63Updated 7 years ago
- transformers implement (architecture, task example, serving and more)☆97Updated 2 years ago
- 中文语料 Bert finetune(Fine-tune Chinese for BERT)☆80Updated 5 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆89Updated 4 years ago
- A Chinese word segment model based on BERT, F1-Score 97%☆90Updated 5 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆79Updated 2 years ago
- 新词发现算法(NewWordDetection)☆93Updated 3 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆62Updated 3 years ago
- Relation Extraction 中文关系提取☆72Updated 5 years ago
- ☆72Updated this week
- NLP NER datasets video/music/book bio☆83Updated 3 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆163Updated 5 years ago
- 中文版unilm预训练模型☆82Updated 3 years ago
- 基于最小熵原理的NLP工具包☆137Updated 2 years ago
- SmoothNLP领域词汇示例 - 基于复旦公开新闻资讯库☆49Updated 4 years ago
- 基于ltp的简单评论观点抽取模块☆116Updated 5 years ago