zhenlohuang/awesome-chinese-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhenlohuang/awesome-chinese-llm)

zhenlohuang / awesome-chinese-llm

Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总

☆167

Alternatives and similar repositories for awesome-chinese-llm

Users that are interested in awesome-chinese-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

butyuhao / Awesome-Chinese-LLM
View on GitHub
☆12Mar 29, 2023Updated 3 years ago
AiHubCN / Awesome-Chinese-LLM
View on GitHub
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
☆22,708May 10, 2026Updated 2 months ago
SigmaQuan / Awesome-Chinese-Corpus-Datasets-and-Models
View on GitHub
Awesome Chinese Corpus Datasets and Models.
☆19Oct 28, 2019Updated 6 years ago
lonePatient / awesome-pretrained-chinese-nlp-models
View on GitHub
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
☆5,575Jun 19, 2026Updated last month
esbatmop / MNBVC
View on GitHub
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志…
☆4,246Jul 13, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YuejiaoGong / HierETA
View on GitHub
Code for the KDD 2022 paper "Interpreting Trajectories from Multiple Views: A Hierarchical Self-Attention Network for Estimating the Time…
☆18May 29, 2022Updated 4 years ago
v-mipeng / ClassImbalanceLearning
View on GitHub
This is the implementation code for the paper "Trainable Undersampling for Class-Imbalance Learning" published in AAAI2019
☆15Mar 17, 2019Updated 7 years ago
yongzhuo / MacroGPT-Pretrain
View on GitHub
macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor
☆15Nov 30, 2023Updated 2 years ago
vba34520 / Rasa-UI
View on GitHub
A simple Rasa UI
☆14Jul 13, 2020Updated 6 years ago
Hv0nnus / MLOT
View on GitHub
This is the repository to reproduce the experiments of the IJCAI 2020 paper "Metric Learning in Optimal Transport for Domain Adaptation"
☆23Jun 9, 2020Updated 6 years ago
CLUEbenchmark / CLUECorpus2020
View on GitHub
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
☆1,017Feb 6, 2026Updated 5 months ago
Ailln / sentiment-analysis
View on GitHub
😄😐😠 情感分析（使用 emoji 可视化）
☆10Sep 5, 2021Updated 4 years ago
LightR0 / hugging_face_tutorials
View on GitHub
☆18Apr 28, 2022Updated 4 years ago
Hannibal046 / GPT-OSS-BrowseCompPlus-Eval
View on GitHub
Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools
☆20Oct 17, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
layumi / ICME2022SS
View on GitHub
ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”
☆12Jun 3, 2024Updated 2 years ago
junhyukk / MAMNet-Tensorflow
View on GitHub
Tensorflow implementation of MAMNet
☆10Apr 2, 2020Updated 6 years ago
YibinShen / TTPNet
View on GitHub
Travel Time Prediction Based on Tensor Decomposition and Graph Embedding
☆29Dec 25, 2020Updated 5 years ago
serfer2 / python-deepweb
View on GitHub
Short demo about how to manage TOR network agent with Python (stem)
☆10Feb 20, 2019Updated 7 years ago
handayu / MC-Fundament-Code
View on GitHub
关于Multicharts程序化交易的基础代码(画图,交易,打印输出等)
☆10May 21, 2019Updated 7 years ago
GDGVIT / onion-crawler
View on GitHub
☆13Nov 7, 2020Updated 5 years ago
biodog / dark_web_spider
View on GitHub
暗网爬虫,暗网交易市场爬虫
☆12Sep 28, 2021Updated 4 years ago
xv44586 / Chinese-instruction-datasets
View on GitHub
中文 Instruction tuning datasets
☆143Apr 10, 2024Updated 2 years ago
joshfaust / Onion-Hunter
View on GitHub
Hunt and Analyze Tor Onion Sites
☆25Dec 8, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
dfederschmidt / pyliwc
View on GitHub
LIWC (Linguistic Inquiry and Word Count) in Python
☆10Dec 8, 2022Updated 3 years ago
KahimWong / ADCD-Net
View on GitHub
[ICCV'25] ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement
☆26Mar 29, 2026Updated 4 months ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago
codejitsu / session4rec
View on GitHub
GRu4Rec in TensorFlow
☆14Apr 11, 2018Updated 8 years ago
wshuyi / demo_text_binary_classification_bert
View on GitHub
☆11Apr 8, 2019Updated 7 years ago
Eighonet / GCT-TTE
View on GitHub
GCT-TTE algorithm & dedicated application
☆10Jan 14, 2024Updated 2 years ago
taishan1994 / pytorch_uie_re
View on GitHub
基于百度uie的关系抽取
☆20Sep 26, 2022Updated 3 years ago
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,279Oct 16, 2024Updated last year
km1994 / nlp_paper_study_search_engine
View on GitHub
该仓库主要记录 NLP 算法工程师相关的搜索引擎学习笔记
☆14Apr 9, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,649Oct 24, 2024Updated last year
yuanxiaosc / Text-generation-task-and-language-model-GPT2
View on GitHub
solve text generation tasks by the language model GPT2, including papers, code, demo demos, and hands-on tutorials. 使用语言模型GPT2来解决文本生成任务的…
☆26Aug 27, 2019Updated 6 years ago
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
val-iisc / BPFC
View on GitHub
Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes
☆23Jun 14, 2020Updated 6 years ago
yongzhuo / pytorch-loss
View on GitHub
pytorch版损失函数，改写自科学空间文章，【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】
☆12Aug 22, 2021Updated 4 years ago
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago