imcaspar/gpt2-ml

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/imcaspar/gpt2-ml)

imcaspar / gpt2-ml

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

☆1,704

Alternatives and similar repositories for gpt2-ml

Users that are interested in gpt2-ml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Morizeyao / GPT2-Chinese
View on GitHub
Chinese version of GPT2 training code, using BERT tokenizer.
☆7,599Apr 25, 2024Updated 2 years ago
yangjianxin1 / GPT2-chitchat
View on GitHub
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型
☆2,996Oct 30, 2023Updated 2 years ago
Morizeyao / Decoders-Chinese-TF2.0
View on GitHub
GPT2 training script for Chinese in Tensorflow 2.0
☆149Oct 1, 2021Updated 4 years ago
wind91725 / gpt2-ml-finetune-
View on GitHub
根据gpt2-ml中文模型finetune自己的数据集
☆44May 22, 2023Updated 3 years ago
TsinghuaAI / CPM-1-Generate
View on GitHub
Chinese Pre-Trained Language Models (CPM-LM) Version-I
☆1,579Mar 18, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,904Feb 6, 2026Updated 5 months ago
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,956Jun 12, 2023Updated 3 years ago
ghosthamlet / gpt2-ml-torch
View on GitHub
Pytorch model for https://github.com/imcaspar/gpt2-ml
☆78Nov 21, 2021Updated 4 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,979Nov 21, 2022Updated 3 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated last year
YunwenTechnology / QueryGeneration
View on GitHub
☆90Jun 20, 2020Updated 6 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
ymcui / Chinese-XLNet
View on GitHub
Pre-Trained Chinese XLNet（中文XLNet预训练模型）
☆1,647Apr 19, 2026Updated 3 months ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,222Apr 19, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bojone / bert4keras
View on GitHub
keras implement of transformers for humans
☆5,417Nov 11, 2024Updated last year
GaoPeng97 / transformer-xl-chinese
View on GitHub
transformer xl在中文文本生成上的尝试（可写小说、古诗）（transformer xl for text generation of chinese）
☆725Apr 7, 2022Updated 4 years ago
brightmart / xlnet_zh
View on GitHub
中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large
☆228Sep 13, 2019Updated 6 years ago
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,270Feb 6, 2026Updated 5 months ago
ChineseGLUE / ChineseGLUE
View on GitHub
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
☆1,783Feb 18, 2023Updated 3 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
YunwenTechnology / Unilm
View on GitHub
☆442Mar 12, 2022Updated 4 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
liucongg / GPT2-NewsTitle
View on GitHub
Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
☆1,110Mar 8, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
codemayq / chinese-chatbot-corpus
View on GitHub
中文公开聊天语料库
☆4,191Apr 23, 2024Updated 2 years ago
CLUEbenchmark / CLUEPretrainedModels
View on GitHub
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆810Jul 8, 2020Updated 6 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
yangjianxin1 / CPM
View on GitHub
Easy-to-use CPM for Chinese text generation（基于CPM的中文文本生成）
☆530Apr 10, 2023Updated 3 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,704May 8, 2023Updated 3 years ago
Embedding / Chinese-Word-Vectors
View on GitHub
100+ Chinese Word Vectors 上百种预训练中文词向量
☆12,229Oct 30, 2023Updated 2 years ago
lipiji / Guyu
View on GitHub
Chinese GPT2: pre-training and fine-tuning framework for text generation
☆187May 24, 2021Updated 5 years ago
CLUEbenchmark / CLUECorpus2020
View on GitHub
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
☆1,015Feb 6, 2026Updated 5 months ago
PaddlePaddle / ERNIE
View on GitHub
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
☆7,722Jan 4, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,180May 28, 2023Updated 3 years ago
InsaneLife / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆4,600Nov 21, 2023Updated 2 years ago
thunlp / OpenCLaP
View on GitHub
Open Chinese Language Pre-trained Model Zoo
☆983Mar 18, 2020Updated 6 years ago
qingkongzhiqian / GPT2-Summary
View on GitHub
基于GPT2的中文摘要生成模型
☆406Jul 6, 2023Updated 3 years ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
microsoft / DialoGPT
View on GitHub
Large-scale pretraining for dialogue
☆2,422Oct 17, 2022Updated 3 years ago
Turing-Project / WriteGPT
View on GitHub
由图灵的猫开发，基于开源GPT2.0的初代创作型人工智能 | 可扩展、可进化
☆5,294Mar 31, 2024Updated 2 years ago