大模型推理压测
☆47Jul 31, 2025Updated 9 months ago
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这个库用于从零开始,搭建一个基于开源大模型的对话系统。包括基本的对话、与文档对话、智能体等多种功能☆10Sep 21, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆45Aug 7, 2025Updated 8 months ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- CanvasAnvil is an AI multi-canvas creation platform for flowcharts, interior design, presentations, posters, infographics, and product st…☆76Apr 25, 2026Updated last week
- 视频理解:千问视频多模态模型 & Dify☆68Sep 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python SDK for AgentRun: Build and deploy AI Agents with Serverless runtime, sandbox execution, and enterprise-grade observability☆19Updated this week
- 大模型智能体Agent中文教程,博客代码仓库☆62Nov 5, 2025Updated 5 months ago
- AI 应用开发工程师面试宝典 - 二狗子整理☆62Apr 24, 2026Updated last week
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- ☆10Dec 18, 2021Updated 4 years ago
- 电商广告推荐系统☆14Jun 3, 2022Updated 3 years ago
- 力扣题单hot100的ACM模式实现☆39Sep 2, 2025Updated 8 months ago
- CCKS举办的针对电子病例的信息抽取比赛,主要是进行医疗实体及事件抽取,本项目包括展示比赛的不断改进与多种方法的尝试,最终取得:valid第6名;test第9名。☆15Oct 10, 2021Updated 4 years ago
- Slowist's notebook☆17Mar 18, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- ☆10Jul 23, 2023Updated 2 years ago
- CopilotKit AI助手演示应用 - 展示前端UI与后端Agent交互☆38Jul 17, 2025Updated 9 months ago
- ☆17Jul 1, 2022Updated 3 years ago
- 自己阅读的多模态对话系统论文(及部分笔记)汇总☆22Jan 5, 2023Updated 3 years ago
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated last year
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- Global Orthographic Object Descriptor (GOOD)☆13Apr 11, 2022Updated 4 years ago
- 浙江大学选课脚本☆14Aug 13, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型 的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- 虚拟 主播助理 进入房间播报 触发礼物事件 及大模型回复弹幕消息☆26Feb 11, 2025Updated last year
- Documentation for users of Jenkins project infrastructure☆25Updated this week
- ☆12May 20, 2020Updated 5 years ago
- A Philosophy of Software Design 《软件设计的哲学》中文翻译☆19Feb 25, 2024Updated 2 years ago
- ☆10Jan 13, 2020Updated 6 years ago
- ☆28Nov 6, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [RAG训练营] u.geekbang.org/subject/airag/1009927 ESG合规审计系统 - 可持续发展报告检查工具☆34Jun 1, 2025Updated 11 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆13Apr 4, 2023Updated 3 years ago
- ncnn qt yolov6☆13Aug 4, 2022Updated 3 years ago
- 本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性…☆45Mar 10, 2025Updated last year
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago