TylunasLi / fastllmLinks
纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM
☆12Updated this week
Alternatives and similar repositories for fastllm
Users that are interested in fastllm are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆135Updated 6 months ago
- ☆57Updated 8 months ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆94Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆42Updated last week
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Updated last year
- ☆146Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆108Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆63Updated last year
- 收集优质的角色扮演聊天数据 | Collection of roleplay conversations of high quality☆13Updated 6 months ago
- 本项目基于modelscope-agent-v0.3和 api-for-open-llm 或 llamacpp 组件共同实现了一个AI Agent,能够利用本地的大模型(LLM)实现使用自定义工具功能。使用了Qwen1.5大模型。☆17Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆194Updated this week
- run chatglm3-6b in BM1684X☆39Updated last year
- Imitate OpenAI with Local Models☆86Updated 10 months ago
- accelerate generating vector by using onnx model☆17Updated last year
- 介绍docker、docker compose的使用。☆20Updated 9 months ago
- chatglm-6B for tools application using langchain☆75Updated 2 years ago
- share data, prompt data , pretraining data☆36Updated last year
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆16Updated 3 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆38Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- bge推理优化相关脚本☆28Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 8 months ago
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Updated 2 years ago
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆130Updated 10 months ago
- 文本语料转训练集工具,txt转dataset☆92Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆56Updated last month
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构 建智能的多模态AI Agent。☆176Updated this week
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week