shell-nlp / gpt_serverLinks
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
☆212Updated last week
Alternatives and similar repositories for gpt_server
Users that are interested in gpt_server are comparing it to the libraries listed below
Sorting:
- An easy-to-use framework for modular RAG☆393Updated this week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆214Updated 11 months ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆212Updated last year
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆209Updated this week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆185Updated 10 months ago
- ☆354Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆135Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆302Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆137Updated 9 months ago
- 探索 LLM 在法律行业的应用潜力☆91Updated 9 months ago
- 基于大语言模型的检索增强生成RAG示例☆158Updated 4 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆262Updated 6 months ago
- Q&A based on elasticsearch+langchain+chatglm2 | 基于elasticsearch,langchain,chatglm2的自有知识库问答☆242Updated 2 years ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆175Updated last week
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆168Updated 11 months ago
- 中文原生检索增强生成测评基准☆122Updated last year
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆70Updated last year
- ☆268Updated 9 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆120Updated 3 months ago
- 通义千问VLLM推理部署DEMO☆608Updated last year
- dify's rag patch module☆275Updated 3 weeks ago
- code for piccolo embedding model from SenseTime☆140Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆310Updated last year
- NLP 项目记录档案☆58Updated 5 months ago
- qwen models finetuning☆103Updated 6 months ago
- 本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.☆278Updated 2 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆246Updated last month
- QA based on local knowledge and LLM.☆237Updated 8 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆128Updated 6 months ago