fengwang / LLaMA-Factory-dockerLinks
☆25Updated last year
Alternatives and similar repositories for LLaMA-Factory-docker
Users that are interested in LLaMA-Factory-docker are comparing it to the libraries listed below
Sorting:
- ☆106Updated 2 years ago
- Imitate OpenAI with Local Models☆90Updated last year
- A light proxy solution for HuggingFace hub.☆49Updated 2 years ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated 2 years ago
- 中文原生检索增强生成测评基准☆124Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆60Updated last year
- 大语言模型训练和服务调研☆37Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆216Updated last year
- qwen models finetuning☆106Updated 10 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated 2 years ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated 2 years ago
- Open efforts to implement ChatGPT-like models and beyond.☆107Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆140Updated last year
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆264Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Updated 4 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated 2 years ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated 2 years ago
- 大语言模型指令调优工具(支持 FlashAttention)☆177Updated 2 years ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- 旨在对当前主流LLM进行一个直观、具体、标准的评测☆95Updated 2 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆99Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Updated last year
- 用于AIOPS24挑战赛的Demo☆64Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Updated last year
- large language model training-3-stages+deployment☆49Updated 2 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Updated 2 years ago