howl-anderson / chinese-wikipedia-corpus-creator
Corpus creator for Chinese Wikipedia
☆41Updated 3 years ago
Alternatives and similar repositories for chinese-wikipedia-corpus-creator:
Users that are interested in chinese-wikipedia-corpus-creator are comparing it to the libraries listed below
- SMP2017中文人机对话评测数据☆107Updated 7 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆75Updated 4 years ago
- 中文 NLP 语料库数据集☆20Updated 6 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 6 years ago
- ☆92Updated 4 months ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆194Updated 3 years ago
- 整理:基于Rasa-NLU和Rasa-Core的任务型ChatBot☆48Updated 6 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆23Updated 6 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 新词发现算法(NewWordDetection)☆62Updated 7 years ago
- ☆42Updated 6 years ago
- 依存关系分析,NLP,自然语言处理☆85Updated 3 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆129Updated last year
- 中文生成式预训练模型☆98Updated 4 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆91Updated 5 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 5 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆123Updated 5 years ago
- ☆75Updated 2 years ago
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆135Updated 3 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆66Updated 6 years ago
- Chinese Open Information Extraction (Tree-based Triple Relation Extraction Module)☆118Updated 7 years ago
- A Public Corpus for Machine Learning☆44Updated 6 years ago
- python CRF++实现分词☆37Updated 6 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- Memory for Knowledge Graph, using Neo4j. 知识图谱存储与查询。☆45Updated 3 months ago
- 各大中文分词性能评测☆157Updated 6 years ago
- 基于知识库的开放域问答系统的相关工作☆69Updated 6 years ago