今日头条中文新闻(文本)分类数据集
☆404May 19, 2021Updated 4 years ago
Alternatives and similar repositories for toutiao-text-classfication-dataset
Users that are interested in toutiao-text-classfication-dataset are comparing it to the libraries listed below
Sorting:
- 今日头条中文新闻文本(多层)分类数据集☆402May 6, 2021Updated 4 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,580Nov 21, 2023Updated 2 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,869Feb 6, 2026Updated last month
- 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆6,493Jan 29, 2019Updated 7 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,183Jul 15, 2025Updated 8 months ago
- CNN-RNN中文文本分类,基于TensorFlow☆4,295Mar 31, 2024Updated last year
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,425Nov 21, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- 中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。☆5,709Sep 23, 2020Updated 5 years ago
- all kinds of text classification models and more with deep learning☆7,950Sep 28, 2023Updated 2 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆471May 8, 2022Updated 3 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,787Feb 18, 2023Updated 3 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,238Feb 6, 2026Updated last month
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,926Jul 27, 2023Updated 2 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services☆4,900Feb 24, 2021Updated 5 years ago
- 中文文本分类,使用搜狗文本分类语料库☆124Jul 31, 2016Updated 9 years ago
- 自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等☆1,731Jul 18, 2022Updated 3 years ago
- 今日头条中文新闻(文本)分类数据集☆69May 14, 2018Updated 7 years ago
- SentiBridge: A Knowledge Base for Entity-Sentiment Representation☆644Sep 20, 2018Updated 7 years ago
- A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)☆2,336Apr 18, 2022Updated 3 years ago
- 使用预训练语言模型BERT做中文NER☆975Feb 26, 2020Updated 6 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,426Jan 22, 2022Updated 4 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Collections of Chinese NLP corpus☆918Dec 28, 2020Updated 5 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- 中文文本语义相似度(Chinese Semantic Text Similarity)语料库建设☆482Mar 7, 2018Updated 8 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- 使用Bert,ERNIE,进行中文文本分类☆4,405Jun 28, 2024Updated last year
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,648Jul 15, 2025Updated 8 months ago
- 一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算☆1,669Oct 14, 2019Updated 6 years ago
- CCKS 2018 开放领域的中文问答任务 1st 解决方案☆109May 26, 2019Updated 6 years ago
- 中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(…☆1,806Jun 17, 2024Updated last year
- Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。☆1,456Oct 20, 2021Updated 4 years ago
- ☆12Jun 24, 2019Updated 6 years ago
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,797Updated this week
- bert中文分类实践☆741Dec 11, 2018Updated 7 years ago
- A concept and obvious expression pattern collection of Chinese compound event extraction which then be evolved into ComplexEventGraph,本项目…☆1,217Dec 15, 2018Updated 7 years ago
- Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取☆2,265Feb 1, 2024Updated 2 years ago