shell-nlp / gpt_serverLinks
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
☆244Updated last week
Alternatives and similar repositories for gpt_server
Users that are interested in gpt_server are comparing it to the libraries listed below
Sorting:
- An easy-to-use framework for modular RAG☆432Updated this week
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆244Updated this week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆216Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆194Updated last year
- ☆274Updated last year
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆263Updated 10 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆125Updated 7 months ago
- 基于大语言模型的检索增强生成RAG示例☆168Updated 9 months ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆194Updated last month
- ☆58Updated last year
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆185Updated last year
- 探索 LLM 在法律行业的应用潜力☆96Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆139Updated last year
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆243Updated 2 years ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated 2 years ago
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆216Updated last year
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆132Updated 10 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆305Updated last year
- ☆363Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆60Updated last year
- 本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.☆292Updated 6 months ago
- dify's rag patch module☆277Updated 5 months ago
- 中文原生检索增强生成测评基准☆124Updated last year
- 大模型检索增强生成技术最佳实践。☆88Updated last year
- 一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索☆523Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆140Updated last year
- 基于开源embedding模型的中文向量效果测试☆148Updated 2 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆260Updated this week
- bge推理优化相关脚本☆29Updated 2 years ago