xubuvd/LLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xubuvd/LLMs)

xubuvd / LLMs

专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。

☆125

Alternatives and similar repositories for LLMs

Users that are interested in LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sunzeyeah / RLHF
View on GitHub
Implementation of Chinese ChatGPT
☆287Nov 20, 2023Updated 2 years ago
carbonz0 / alpaca-chinese-dataset
View on GitHub
alpaca中文指令微调数据集
☆395Mar 26, 2023Updated 3 years ago
shibing624 / lmft
View on GitHub
ChatGLM-6B fine-tuning.
☆135Apr 25, 2023Updated 3 years ago
Chinese-Tiny-LLM / Chinese-Tiny-LLM
View on GitHub
☆237May 10, 2024Updated 2 years ago
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuanzhoulvpi2017 / zero_nlp
View on GitHub
中文nlp解决方案(大模型、数据、模型、训练、推理)
☆3,830Aug 5, 2025Updated 11 months ago
ssbuild / aigc_evals
View on GitHub
aigc evals
☆10Dec 2, 2023Updated 2 years ago
JovenChu / FasterTransformer_Bert
View on GitHub
Using FasterTransformer for accelerating the predict speed of bert and roberta
☆14Sep 20, 2019Updated 6 years ago
CASIA-LM / MoDS
View on GitHub
☆153Apr 16, 2024Updated 2 years ago
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,646Oct 24, 2024Updated last year
pp1230 / LLMGPUMemEstimator
View on GitHub
The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.
☆35Apr 9, 2024Updated 2 years ago
yongzhuo / chatglm-maths
View on GitHub
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
☆165Aug 24, 2023Updated 2 years ago
ssbuild / moss_finetuning
View on GitHub
moss chat finetuning
☆51Apr 23, 2024Updated 2 years ago
FreedomIntelligence / FastLLM
View on GitHub
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Jan 4, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
StarRing2022 / ChatGPTX-Uni
View on GitHub
实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案，LLM-Base+LLM-X+Alpaca，初期，LLM-Base为Chatglm6B底座模型，LLM-X是LLAMA增强模型。该方案简易高效，目标是使此类语言模型能够低能耗广泛部署，并最…
☆114Jul 19, 2023Updated 3 years ago
ssbuild / chatglm_finetuning
View on GitHub
chatglm 6b finetuning and alpaca finetuning
☆1,528Mar 9, 2025Updated last year
shibing624 / nerpy
View on GitHub
🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具，支持BertSoftmax、BertSpan等模型，开箱即用。
☆118Feb 19, 2024Updated 2 years ago
HIT-SCIR / Chinese-Mixtral-8x7B
View on GitHub
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
☆651Aug 17, 2024Updated last year
shibing624 / textgen
View on GitHub
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLO…
☆981Sep 14, 2024Updated last year
zhanshijinwat / Steel-LLM
View on GitHub
Train a 1B LLM with 1T tokens from scratch by personal
☆810Apr 27, 2025Updated last year
hikariming / chat-dataset-baseline
View on GitHub
人工精调的中文对话数据集和一段chatglm的微调代码
☆1,190May 3, 2025Updated last year
StarRing2022 / MiniRWKV-4
View on GitHub
实现Blip2RWKV+QFormer的多模态图文对话大模型，使用Two-Step Cognitive Psychology Prompt方法，仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4，ImageBind等图文对话大语言模型，力求以更小的算力和资源实…
☆42Jul 17, 2023Updated 3 years ago
IDEA-CCNL / Ziya-Coding
View on GitHub
☆15Oct 9, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JunnYu / ChineseBert_pytorch
View on GitHub
huggingface ChineseBert Tokenizer
☆16Apr 16, 2022Updated 4 years ago
dandelionsllm / pandallm
View on GitHub
Panda项目是于2023年5月启动的开源海外中文大语言模型项目，致力于大模型时代探索整个技术栈，旨在推动中文自然语言处理领域的创新和合作。
☆1,032Oct 19, 2023Updated 2 years ago
yanqiangmiffy / InstructGLM
View on GitHub
ChatGLM-6B 指令学习|指令数据|Instruct
☆651Apr 10, 2023Updated 3 years ago
liangwq / LLM_StableDiffusion_Studio
View on GitHub
smart chinese LLm
☆19Jan 31, 2024Updated 2 years ago
shibing624 / MedicalGPT
View on GitHub
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
☆5,636Jun 3, 2026Updated last month
OpenLLMAI / OpenLLMWiki
View on GitHub
OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…
☆269Dec 10, 2024Updated last year
ssbuild / pytorch-task-example
View on GitHub
deep training task
☆30Apr 28, 2023Updated 3 years ago
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,273Oct 16, 2024Updated last year
feizc / Visual-ChatGLM
View on GitHub
Open ChatGLM Eyes to See the World
☆13Mar 30, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
paulxin001 / ChatGLM-sanguo
View on GitHub
This project is mainly to explore what effect can be achieved by fine-tuning LLM model (ChatGLM-6B)of about 6B in vertical field (Romance…
☆26Apr 6, 2023Updated 3 years ago
SunDoge / bytepiece-rs
View on GitHub
更纯粹、更高压缩率的Tokenizer in Rust
☆14Dec 21, 2024Updated last year
mymusise / ChatGLM-Tuning
View on GitHub
基于ChatGLM-6B + LoRA的Fintune方案
☆3,744Nov 25, 2023Updated 2 years ago
PhoebusSi / Alpaca-CoT
View on GitHub
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,791Dec 12, 2023Updated 2 years ago
PRIS-CV / MSSRM
View on GitHub
An implementation of MSSRM method
☆10Mar 23, 2023Updated 3 years ago
keezen / ntk_alibi
View on GitHub
NTK scaled version of ALiBi position encoding in Transformer.
☆69Aug 16, 2023Updated 2 years ago
Dustyposa / rasa_bot_front
View on GitHub
基于 rasa 的机器人的前端项目
☆41Apr 27, 2022Updated 4 years ago