ArtificialZeng / baichuan-speedupLinks

纯c++的全平台llm加速库，支持python调用，支持baichuan, glm, llama, moss基座，手机端流畅运行chatglm-6B级模型单卡可达10000+token / s，

☆45

Alternatives and similar repositories for baichuan-speedup

Users that are interested in baichuan-speedup are comparing it to the libraries listed below

Sorting:

ProjectD-AI / llama_inference
llama inference for tencentpretrain
☆99Updated 2 years ago
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆110Updated 2 years ago
ssbuild / qwen_finetuning
qwen models finetuning
☆105Updated 7 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆138Updated 10 months ago
zhangzhenyu13 / llm3s-conatiner
large language model training-3-stages+deployment
☆49Updated 2 years ago
ArtificialZeng / Baichuan2-Explained
Baichuan2代码的逐行解析版本，适合小白
☆214Updated 2 years ago
lilongxian / BaiYang-chatGLM2-6B
（1）弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练，提高万级tokens性能支持。（2）证据理论解释学习，提升模型的复杂逻辑推理能力（3）兼容alpaca数据格式。
☆45Updated 2 years ago
K024 / chatglm-q
Another ChatGLM2 implementation for GPTQ quantization
☆53Updated 2 years ago
Tlntin / ChatGLM2-6B-TensorRT
☆90Updated 2 years ago
xubuvd / LLMs
专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。
☆123Updated 7 months ago
Oneflow-Inc / one-glm
A more efficient GLM implementation!
☆54Updated 2 years ago
IDEA-CCNL / GTS-Engine
GTS Engine: A powerful NLU Training System。GTS引擎（GTS-Engine）是一款开箱即用且性能强大的自然语言理解引擎，聚焦于小样本任务，能够仅用小样本就能自动化生产NLP模型。
☆92Updated 2 years ago
pleisto / yuren-baichuan-7b
基于baichuan-7b的开源多模态大语言模型
☆72Updated last year
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
yongzhuo / chatglm-maths
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
☆165Updated 2 years ago
CrazyBoyM / llama2-Chinese-chat
首个llama2 13b 中文版模型（Base + 中文对话SFT，实现流畅多轮人机自然语言交互)
☆91Updated 2 years ago
gameofdimension / vllm-cn
演示 vllm 对中文大语言模型的神奇效果
☆31Updated last year
georgechen1827 / ChatGLM-text-embedding
use chatGLM to perform text embedding
☆45Updated 2 years ago
AtomEcho / AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
☆94Updated 2 years ago
OKC13 / General-Documents-Layout-parser
通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
☆47Updated last year
billvsme / my_openai_api
部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ，实现了OpenAI中Chat, Models和Completions接口，包含流式响…
☆96Updated last year
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated last year
AI-Study-Han / Mini-Llama2-Chinese
想要从零开始训练一个中文的mini大语言模型，可以进行基本的对话，模型大小根据手头的机器决定
☆63Updated last year
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆48Updated 2 weeks ago
shibing624 / lmft
ChatGLM-6B fine-tuning.
☆136Updated 2 years ago
OpenSenseNova / piccolo-embedding
code for piccolo embedding model from SenseTime
☆142Updated last year
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆90Updated 2 years ago
yongzhuo / LLM-SFT
中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…
☆211Updated last year
vxfla / kanchil
Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。
☆113Updated 2 years ago
ArtificialZeng / Qwen-Explained
千问14B和7B的逐行解释
☆62Updated 2 years ago