quincyliang / nlp-public-datasetView external linksLinks
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
☆371Feb 3, 2021Updated 5 years ago
Alternatives and similar repositories for nlp-public-dataset
Users that are interested in nlp-public-dataset are comparing it to the libraries listed below
Sorting:
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,418Nov 21, 2022Updated 3 years ago
- a neural machine translation system from english (chinese) to chinese (english) based on 30m parallel data.☆69Mar 31, 2021Updated 4 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,575Nov 21, 2023Updated 2 years ago
- code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer☆1,005May 10, 2022Updated 3 years ago
- NER(命名实体识别)中文语料,一站式获取☆130Sep 10, 2019Updated 6 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,857Feb 6, 2026Updated last week
- 中英机器文本翻译☆168Jul 2, 2019Updated 6 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,173Jul 15, 2025Updated 7 months ago
- 在tensor2tensor中使用自己的语料实现中英文翻译☆23Mar 18, 2019Updated 6 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,181Oct 30, 2023Updated 2 years ago
- Collections of Chinese NLP corpus☆917Dec 28, 2020Updated 5 years ago
- Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)☆2,231Mar 11, 2023Updated 2 years ago
- TestB榜第10的方案,bleu32.1☆63Nov 28, 2019Updated 6 years ago
- DeepIE: Deep Learning for Information Extraction☆1,943Dec 9, 2022Updated 3 years ago
- Chinese NER using Lattice LSTM. Code for ACL 2018 paper.☆1,834Apr 25, 2019Updated 6 years ago
- 中文生成式预训练模型☆99Aug 28, 2020Updated 5 years ago
- 中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)☆2,272Jun 21, 2022Updated 3 years ago
- 英中机器文本翻译☆63Jan 2, 2019Updated 7 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 10 months ago
- Codes for "TENER: Adapting Transformer Encoder for Named Entity Recognition"☆378Jul 6, 2020Updated 5 years ago
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- 基于双向RNN,Attention机制的编解码神经机器翻译模型☆62Jan 15, 2018Updated 8 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 7 months ago
- Chinese Language Generation Evaluation 中文生成任务基准测评☆249Dec 9, 2020Updated 5 years ago
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,926Jul 27, 2023Updated 2 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,230Feb 6, 2026Updated last week
- 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆6,468Jan 29, 2019Updated 7 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,273May 19, 2022Updated 3 years ago
- 中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF☆1,460Mar 15, 2020Updated 5 years ago
- Using CRF++ for NER☆20Feb 28, 2019Updated 6 years ago
- 序列化标注工具,基于PyTorch实现BLSTM-CNN-CRF模型,CoNLL 2003 English NER测试集F1值为91.10%(word and char feature)。☆364Jul 24, 2018Updated 7 years ago
- Datasets, SOTA results of every fields of Chinese NLP☆1,815Apr 7, 2022Updated 3 years ago
- Chinese NER using Lattice LSTM. Reproduction for ACL 2018 paper.☆131Apr 17, 2020Updated 5 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)☆2,339Apr 18, 2022Updated 3 years ago
- Kashgari 框架的中文文档☆22Sep 11, 2020Updated 5 years ago
- Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations☆425Oct 3, 2023Updated 2 years ago