shell-nlp / gpt_server
gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。
☆158Updated this week
Alternatives and similar repositories for gpt_server:
Users that are interested in gpt_server are comparing it to the libraries listed below
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支 持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆240Updated 2 months ago
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆125Updated 4 months ago
- 基于大语言模型的检索增强生成RAG示例☆130Updated 3 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 轻松构建智能、具备反思能力、可协作的多模态AI Agent。☆142Updated this week
- dify's rag patch module☆172Updated last month
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆120Updated 8 months ago
- ☆214Updated 2 months ago
- A collection of RAG systems powered by LLM.☆164Updated this week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆193Updated 5 months ago
- 本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.☆223Updated 2 months ago
- 中文原生检索增强生成测评基准☆112Updated 10 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆267Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated 3 months ago
- ☆60Updated 4 months ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆90Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆186Updated 9 months ago
- ☆63Updated 5 months ago
- An easy-to-use framework for modular RAG☆336Updated this week
- 大模型检索增强生成技术最佳实践。☆66Updated 6 months ago
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆114Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆117Updated 6 months ago
- ☆310Updated 9 months ago
- Imitate OpenAI with Local Models☆87Updated 6 months ago
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆235Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆150Updated 3 months ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆355Updated 3 months ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆248Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆290Updated this week
- 基于开源embedding模型的中文向量效果测试☆132Updated last year
- qwen models finetuning☆91Updated this week