ares5221 / Common-NLP-DatasetsLinks
☆16Updated 5 years ago
Alternatives and similar repositories for Common-NLP-Datasets
Users that are interested in Common-NLP-Datasets are comparing it to the libraries listed below
Sorting:
- ☆102Updated 4 years ago
- 用bert4keras加载CDial-GPT☆38Updated 4 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆132Updated 2 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆93Updated 5 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 5 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆78Updated 5 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆76Updated 5 years ago
- 中文文本纠错模型,keras实现☆75Updated 4 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Updated 4 years ago
- 2020语言与智能技术竞赛-关系抽取-第三名方案☆28Updated 4 years ago
- 用BERT在百度WebQA中文问答数据集上做阅读问答☆65Updated 5 years ago
- 无监督文本生成的一些方法☆49Updated 4 years ago
- 时间抽取、解析、标准化工具☆55Updated 2 years ago
- ☆51Updated 3 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- 中文生成式预训练模型☆99Updated 4 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- 李傲龍的博客☆82Updated last year
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Updated 5 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Updated 2 years ago
- ☆90Updated 5 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆90Updated 5 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆141Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆59Updated 5 years ago
- This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.☆59Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- transformers implement (architecture, task example, serving and more)☆96Updated 3 years ago
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆85Updated 2 years ago