zRzRzRzRzRzRzR / lm-fly
大模型推理框架加速,让 LLM 飞起来
☆19Updated 10 months ago
Alternatives and similar repositories for lm-fly:
Users that are interested in lm-fly are comparing it to the libraries listed below
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆48Updated this week
- Examples for QinYan GLMs☆11Updated 6 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated this week
- 我们是第一个完全可商用的 角色大模型。☆39Updated 7 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆115Updated 10 months ago
- GLM Series Edge Models☆130Updated 3 weeks ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆237Updated this week
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆40Updated 4 months ago
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆25Updated 9 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 9 months ago
- ☆16Updated 8 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆9Updated last month
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆65Updated 8 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen …☆13Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆45Updated 2 months ago
- Imitate OpenAI with Local Models☆87Updated 6 months ago
- A simple way to synthesize LLM training data. (under construction⚠)☆16Updated 2 weeks ago
- You can play any API server that compatible with OpenAI API☆23Updated 9 months ago
- 🌱 将智谱清言官方智能体API转换为OpenAI兼容协议的网关 👋 帮助开发者们降低接入API的门槛☆44Updated 10 months ago
- LLM101n: Let's build a Storyteller 中文版☆127Updated 6 months ago
- ☆134Updated 9 months ago
- Just for debug☆56Updated last year
- ☆19Updated 8 months ago
- Easy-GPT4O opensource version☆74Updated 9 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 7 months ago
- Real time faster whisper gradio☆26Updated 5 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆208Updated 2 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 2 months ago
- A quantization algorithm for LLM☆134Updated 8 months ago