fengwang / LLaMA-Factory-dockerLinks

☆25

Alternatives and similar repositories for LLaMA-Factory-docker

Users that are interested in LLaMA-Factory-docker are comparing it to the libraries listed below

Sorting:

chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
ouwei2013 / baichuan13b.cpp
ggml implementation of the baichuan13b model (adapted from llama.cpp)
☆55Updated 2 years ago
zzlgreat / smart_agent
☆106Updated 2 years ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
ssbuild / qwen_finetuning
qwen models finetuning
☆105Updated 8 months ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆49Updated 2 years ago
shuyhere / all-about-llm
大语言模型训练和服务调研
☆36Updated 2 years ago
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Updated last year
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆123Updated last year
shiyemin / light-hf-proxy
A light proxy solution for HuggingFace hub.
☆46Updated 2 years ago
SunLemuria / OpenGPTAndBeyond
Open efforts to implement ChatGPT-like models and beyond.
☆107Updated last year
the-seeds / imitater
Imitate OpenAI with Local Models
☆89Updated last year
OpenCSGs / llm-finetune
The framework of training large language models，support lora, full parameters fine tune etc, define yaml to start training/fine tune of y…
☆30Updated last year
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆58Updated last year
ArtificialZeng / transformers-Explained
官方transformers源码解析。AI大模型时代，pytorch、transformer是新操作系统，其他都是运行在其上面的软件。
☆17Updated 2 years ago
ArtificialZeng / baichuan-speedup
纯c++的全平台llm加速库，支持python调用，支持baichuan, glm, llama, moss基座，手机端流畅运行chatglm-6B级模型单卡可达10000+token / s，
☆45Updated 2 years ago
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated last year
Academic-Hammer / HammerLLM
1.4B sLLM for Chinese and English - HammerLLM🔨
☆43Updated last year
RapidAI / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆68Updated 2 years ago
liguodongiot / unify-easy-llm
unify-easy-llm（ULM）旨在打造一个简易的一键式大模型训练工具，支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。
☆58Updated last year
CLUEbenchmark / SuperCLUE-Industry
中文原生工业测评基准
☆15Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆68Updated last year
paulxin001 / ChatGLM-sanguo
This project is mainly to explore what effect can be achieved by fine-tuning LLM model (ChatGLM-6B)of about 6B in vertical field (Romance…
☆26Updated 2 years ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
WangRongsheng / Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
☆264Updated last year
CyberCommy / baidu-qa-100w
百度QA100万数据集
☆47Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆154Updated 2 years ago
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆47Updated last month