yangjianxin1 / CPM
Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)
☆530Updated last year
Alternatives and similar repositories for CPM:
Users that are interested in CPM are comparing it to the libraries listed below
- 基于GPT2的中文摘要生成模型☆409Updated last year
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆936Updated 2 years ago
- transformer xl在中文文本生成上的尝试(可写小说、古诗)(transformer xl for text generation of chinese)☆709Updated 2 years ago
- dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识…☆330Updated 9 months ago
- ☆246Updated 2 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,812Updated last year
- Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。☆1,104Updated 2 years ago
- 中文生成式预训练模型☆559Updated 2 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆808Updated 4 years ago
- Open Language Pre-trained Model Zoo☆992Updated 3 years ago
- 提供一款中文版生成式摘要服务☆336Updated this week
- PERT: Pre-training BERT with Permuted Language Model☆356Updated last year
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆597Updated last year
- 机器阅读理解 冠军/亚军代码及中文预训练MRC模型☆734Updated 2 years ago
- ☆411Updated 11 months ago
- 自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of N…☆659Updated last year
- ☆438Updated 2 years ago
- SimBERT升级版(SimBERTv2)!☆441Updated 2 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆950Updated 5 months ago
- pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。☆1,292Updated 2 years ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆520Updated last year
- 中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word sign…☆408Updated 8 months ago
- Poetry-related datasets developed by THUAIPoet (Jiuge) group.☆219Updated 4 years ago
- MiniRBT (中文小型预训练模型系列)☆265Updated last year
- a bert for retrieval and generation☆852Updated 3 years ago
- GPT2 training script for Chinese in Tensorflow 2.0☆153Updated 3 years ago
- GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)☆3,007Updated last year
- pytorch中文语言模型预训练☆389Updated 4 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆749Updated last year
- Open Chinese Language Pre-trained Model Zoo☆978Updated 4 years ago