zRzRzRzRzRzRzR / lm-fly
大模型推理框架加速,让 LLM 飞起来
☆19Updated 11 months ago
Alternatives and similar repositories for lm-fly:
Users that are interested in lm-fly are comparing it to the libraries listed below
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated 2 weeks ago
- Examples for QinYan GLMs☆13Updated 7 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆119Updated 11 months ago
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆25Updated 10 months ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 8 months ago
- GLM Series Edge Models☆136Updated 2 months ago
- ☆16Updated 9 months ago
- Qwen GRPO Graph Extraction RL Finetune☆46Updated 3 weeks ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆13Updated last week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 10 months ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆11Updated last month
- ☆140Updated 11 months ago
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆242Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 3 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 9 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆243Updated last week
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆19Updated 7 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- ☆119Updated last week
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆155Updated 5 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆215Updated last week
- Evaluation for AI apps and agent☆40Updated last year
- LLM101n: Let's build a Storyteller 中文版☆131Updated 8 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆157Updated this week