Collections of Chinese NLP corpus
☆918Dec 28, 2020Updated 5 years ago
Alternatives and similar repositories for Chinese-NLP-Corpus
Users that are interested in Chinese-NLP-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer☆1,004May 10, 2022Updated 3 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,581Nov 21, 2023Updated 2 years ago
- Chinese NER using Lattice LSTM. Code for ACL 2018 paper.☆1,835Apr 25, 2019Updated 6 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 3 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,179Jul 15, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)☆2,237Mar 11, 2023Updated 3 years ago
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,422Nov 21, 2022Updated 3 years ago
- DeepIE: Deep Learning for Information Extraction☆1,943Dec 9, 2022Updated 3 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,238Feb 6, 2026Updated last month
- CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition☆1,523Nov 21, 2022Updated 3 years ago
- 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆6,507Jan 29, 2019Updated 7 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,872Feb 6, 2026Updated last month
- CCKS2019中文命名实体识别任务。从医疗文本中识别疾病和诊断、解剖部位、影像检查、实验室检验、手术和药物6种命名实体。现已实现基于jieba和AC自动机的baseline构建、基于BiLSTM和CRF的序列标住模型构建。bert的部分代码主要源于https://gith…☆354Oct 9, 2022Updated 3 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.☆558Jun 9, 2020Updated 5 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- 中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模 型/论文/etc☆2,560Jan 17, 2024Updated 2 years ago
- Datasets, SOTA results of every fields of Chinese NLP☆1,812Apr 7, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,983Nov 21, 2022Updated 3 years ago
- ccks baidu entity link 实体链接 第一名☆842Dec 19, 2023Updated 2 years ago
- 中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)☆2,275Jun 21, 2022Updated 3 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- A neural network model for Chinese named entity recognition☆1,827Aug 9, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,385May 31, 2022Updated 3 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,880Mar 18, 2025Updated last year
- Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`☆679Jun 12, 2023Updated 2 years ago
- fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.☆3,147Jun 5, 2023Updated 2 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,648Jul 15, 2025Updated 8 months ago
- [Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.☆2,597Mar 7, 2026Updated 3 weeks ago
- Data Augmentation for NLP. NLP数据增强☆294Dec 10, 2020Updated 5 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services☆4,903Feb 24, 2021Updated 5 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Chinese EHR Bert Pretrained Model.☆269Jul 14, 2021Updated 4 years ago
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,798Mar 15, 2026Updated last week
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- using bilstm-crf,bert and other methods to do sequence tagging task☆415Jun 12, 2023Updated 2 years ago
- 中文知识库问答代码,CCKS2019 CKBQA评测第四名解决方案☆477Apr 5, 2021Updated 4 years ago
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,928Jul 27, 2023Updated 2 years ago
- 自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。☆731Jan 15, 2021Updated 5 years ago