pandada8 / llm-inference-benchmark
LLM 推理服务性能测试
☆27Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-inference-benchmark
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆48Updated 5 months ago
- ☆213Updated 6 months ago
- 📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解☆48Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆124Updated 11 months ago
- Imitate OpenAI with Local Models☆85Updated 2 months ago
- 专注于Python/C++/CUDA、ML/DL/RL和NLP/KG/DS/LLM领域的技术分享。☆63Updated 4 months ago
- 怎么训练一个LLM分词器☆130Updated last year
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆142Updated last year
- A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limi…☆25Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆269Updated 4 months ago
- 中文 大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆169Updated 6 months ago
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆124Updated 2 weeks ago
- 大语言模型指令调优工具(支持 FlashAttention)☆166Updated 10 months ago
- 基于BM25、BGE的检索增强生成RAG示例☆100Updated 3 weeks ago
- ☆85Updated 2 weeks ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆44Updated 6 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆137Updated 2 months ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated last year
- ☆82Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- 中文原生检索增强生成 测评基准☆100Updated 7 months ago
- 通义千问的DPO训练☆27Updated 2 months ago
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆17Updated 2 weeks ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆139Updated last week
- ☆90Updated last year
- SUS-Chat: Instruction tuning done right☆47Updated 10 months ago
- 文本去重☆67Updated 6 months ago
- 使用单个24G显卡,从0开始训练LLM☆49Updated last month
- NLP 项目记录档案☆43Updated last month
- 大模型多维度中文对齐评测基准 (ACL 2024)☆334Updated 3 months ago