TylunasLi / fastllmLinks

纯c++的全平台llm加速库，支持python调用，支持chatglm-6B, llama, baichuan, moss基座，x86 / ARM

☆12

Alternatives and similar repositories for fastllm

Users that are interested in fastllm are comparing it to the libraries listed below

Sorting:

QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆135Updated 6 months ago
mMrBun / AIPC
☆57Updated 8 months ago
billvsme / my_openai_api
部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ，实现了OpenAI中Chat, Models和Completions接口，包含流式响…
☆94Updated last year
li-xiu-qi / SmartlmageFinder
一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统，实现精准的以文搜文，文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…
☆42Updated last week
lilongxian / BaiYang-chatGLM2-6B
（1）弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练，提高万级tokens性能支持。（2）证据理论解释学习，提升模型的复杂逻辑推理能力（3）兼容alpaca数据格式。
☆44Updated last year
mobvoi / seq-monkey-data
☆146Updated last year
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆108Updated last year
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆63Updated last year
Orion-zhen / roleplay-dataset
收集优质的角色扮演聊天数据 | Collection of roleplay conversations of high quality
☆13Updated 6 months ago
MGzhou / modelscope-agent-with-local-llm
本项目基于modelscope-agent-v0.3和 api-for-open-llm 或 llamacpp 组件共同实现了一个AI Agent，能够利用本地的大模型（LLM）实现使用自定义工具功能。使用了Qwen1.5大模型。
☆17Updated last year
shell-nlp / gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。
☆194Updated this week
sophgo / ChatGLM3-TPU
run chatglm3-6b in BM1684X
☆39Updated last year
llm-factory / imitater
Imitate OpenAI with Local Models
☆86Updated 10 months ago
amulil / vector_by_onnxmodel
accelerate generating vector by using onnx model
☆17Updated last year
peilongchencc / docker_tutorial
介绍docker、docker compose的使用。
☆20Updated 9 months ago
Executedone / ChatGLM4Tools
chatglm-6B for tools application using langchain
☆75Updated 2 years ago
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated last year
li-xiu-qi / x-pdf2md
本项目借助飞桨平台，构建起一套创新的多模型协同系统，实现 PDF 文件到 Markdown 文件的高效、精准转换。
☆16Updated 3 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
StarRing2022 / MiniRWKV-4
实现Blip2RWKV+QFormer的多模态图文对话大模型，使用Two-Step Cognitive Psychology Prompt方法，仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4，ImageBind等图文对话大语言模型，力求以更小的算力和资源实…
☆38Updated last year
limafang / Xtuner-GUI
Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…
☆13Updated last year
flyme2023 / bge
bge推理优化相关脚本
☆28Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆48Updated last year
open-chinese / alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集
☆208Updated 8 months ago
ChaimEvans / ChatGLM_MultiGPUCPU_eval
✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备（GPU、CPU）进行推理
☆34Updated 2 years ago
shibing624 / chatgpt-webui
ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面
☆130Updated 10 months ago
v3ucn / llama3-txt2json-dataset-maker
文本语料转训练集工具，txt转dataset
☆92Updated last year
Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …
☆56Updated last month
shibing624 / agentica
Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。
☆176Updated this week
shibing624 / github-hot
Tracking the hot Github repos and update daily 每天自动追踪Github热门项目
☆49Updated this week