K024 / chatglm-qLinks

Another ChatGLM2 implementation for GPTQ quantization

☆54

Alternatives and similar repositories for chatglm-q

Users that are interested in chatglm-q are comparing it to the libraries listed below

Sorting:

Oneflow-Inc / one-glm
A more efficient GLM implementation!
☆55Updated 2 years ago
vxfla / kanchil
Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。
☆112Updated 2 years ago
ProjectD-AI / llama_inference
llama inference for tencentpretrain
☆98Updated 2 years ago
LC1332 / Luotuo-Silk-Road
Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…
☆39Updated last year
StarRing2022 / ChatGPTX-Uni
实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案，LLM-Base+LLM-X+Alpaca，初期，LLM-Base为Chatglm6B底座模型，LLM-X是LLAMA增强模型。该方案简易高效，目标是使此类语言模型能够低能耗广泛部署，并最…
☆116Updated 2 years ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆47Updated last year
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆87Updated 2 years ago
ArtificialZeng / baichuan-speedup
纯c++的全平台llm加速库，支持python调用，支持baichuan, glm, llama, moss基座，手机端流畅运行chatglm-6B级模型单卡可达10000+token / s，
☆45Updated last year
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆136Updated 7 months ago
ssbuild / deep_training
deep learning
☆148Updated 2 months ago
silverriver / ChatGLM-6B-Slim
ChatGLM-6B-Slim：裁减掉20K图片Token的ChatGLM-6B，完全一样的性能，占用更小的显存。
☆126Updated 2 years ago
llm-factory / imitater
Imitate OpenAI with Local Models
☆87Updated 11 months ago
IDEA-CCNL / GTS-Engine
GTS Engine: A powerful NLU Training System。GTS引擎（GTS-Engine）是一款开箱即用且性能强大的自然语言理解引擎，聚焦于小样本任务，能够仅用小样本就能自动化生产NLP模型。
☆91Updated 2 years ago
Tlntin / ChatGLM2-6B-TensorRT
☆90Updated 2 years ago
alanshi / charset_mnbvc
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
☆61Updated 9 months ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆175Updated last year
lilongxian / BaiYang-chatGLM2-6B
（1）弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练，提高万级tokens性能支持。（2）证据理论解释学习，提升模型的复杂逻辑推理能力（3）兼容alpaca数据格式。
☆44Updated 2 years ago
OpenSenseNova / piccolo-embedding
code for piccolo embedding model from SenseTime
☆134Updated last year
aplmikex / deduplication_mnbvc
文本去重
☆75Updated last year
ssbuild / qwen_finetuning
qwen models finetuning
☆101Updated 4 months ago
LydiaXiaohongLi / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆19Updated 2 years ago
yongzhuo / chatglm-maths
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
☆164Updated last year
ziwang-com / zero-lora
zero零训练llm调参
☆31Updated 2 years ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆66Updated 2 years ago
ouwei2013 / baichuan13b.cpp
ggml implementation of the baichuan13b model (adapted from llama.cpp)
☆54Updated 2 years ago
yangjianxin1 / LLMPruner
☆307Updated 2 years ago
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆108Updated 2 years ago
georgechen1827 / ChatGLM-text-embedding
use chatGLM to perform text embedding
☆45Updated 2 years ago