zRzRzRzRzRzRzR / lm-flyLinks
大模型推理框架加速,让 LLM 飞起来
☆20Updated last year
Alternatives and similar repositories for lm-fly
Users that are interested in lm-fly are comparing it to the libraries listed below
Sorting:
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆30Updated 2 weeks ago
- Examples for QinYan GLMs☆13Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆54Updated last week
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆32Updated 3 months ago
- GLM Series Edge Models☆151Updated 4 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆267Updated 2 months ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Updated 7 months ago
- Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.☆185Updated last week
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆138Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆62Updated this week
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆249Updated last year
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆201Updated 3 weeks ago
- llms related stuff , including code, docs☆13Updated 8 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆293Updated 4 months ago
- LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway☆13Updated 9 months ago
- A fluent, scalable, and easy-to-use LLM data processing framework.☆26Updated 3 months ago
- ☆16Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- ☆164Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆216Updated this week
- LLM Inference benchmark☆428Updated last year
- 顾名思义:手搓的RAG☆130Updated last year
- LLM101n: Let's build a Storyteller 中文版☆135Updated last year
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated last year
- NLP 项目记录档案☆60Updated 6 months ago
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆26Updated last year
- Imitate OpenAI with Local Models☆88Updated last year
- 一个简单的恰到好处LLM应用框架,能够让你以最“Code Center“的方式无缝集成LLM能力。LLM As Function, Prompt As Code☆69Updated this week
- You can play any API server that compatible with OpenAI API☆24Updated last year