seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBiLinks

百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本

☆49

Alternatives and similar repositories for baichuan-Dynamic-NTK-ALiBi

Users that are interested in baichuan-Dynamic-NTK-ALiBi are comparing it to the libraries listed below

Sorting:

keezen / ntk_alibi
NTK scaled version of ALiBi position encoding in Transformer.
☆69Updated 2 years ago
LydiaXiaohongLi / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆19Updated 2 years ago
CLUEbenchmark / ZeroCLUE
零样本学习测评基准，中文版
☆57Updated 4 years ago
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
MikeGu721 / EasyLLM
make LLM easier to use
☆59Updated 2 years ago
Langboat / mengzi-zero-shot
NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model
☆76Updated 3 years ago
Longyichen / Alpaca-family-library
Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.
☆137Updated 2 years ago
yangzhipeng1108 / DeepSpeed-Chat-ChatGLM
☆44Updated last year
ssbuild / moss_finetuning
moss chat finetuning
☆51Updated last year
thu-coai / OPD
OPD: Chinese Open-Domain Pre-trained Dialogue Model
☆75Updated 2 years ago
CLUEbenchmark / SuperCLUE-Math6
SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅
☆60Updated last year
llmeval / LLMEval-1
中文大语言模型评测第一期
☆110Updated last year
OpenBMB / DecT
Source code for ACL 2023 paper Decoder Tuning: Efﬁcient Language Understanding as Decoding
☆51Updated 2 years ago
mutonix / RefGPT
☆98Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆153Updated 2 years ago
Oneflow-Inc / one-glm
A more efficient GLM implementation!
☆54Updated 2 years ago
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆90Updated 2 years ago
ProjectD-AI / LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆68Updated 2 years ago
bitallin / MiduCTC-competition
文本智能校对大赛(Chinese Text Correction)的baseline
☆68Updated 3 years ago
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆110Updated 2 years ago
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆129Updated 2 years ago
aplmikex / deduplication_mnbvc
文本去重
☆76Updated last year
genggui001 / Megatron-DeepSpeed-Llama
☆84Updated 2 years ago
basicv8vc / chinese-instruction-datasets-for-llms
用于微调LLM的中文指令数据集
☆28Updated 2 years ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
sufengniu / RefGPT
☆163Updated 2 years ago
georgechen1827 / ChatGLM-text-embedding
use chatGLM to perform text embedding
☆45Updated 2 years ago
Felixgithub2017 / MMCU
MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING
☆89Updated last year
BAAI-WuDao / Data
“悟道”数据
☆49Updated 4 years ago
llmeval / LLMEval-2
中文大语言模型评测第二期
☆71Updated last year