大模型推理压测
☆47Jul 31, 2025Updated 9 months ago
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这个库用于从零开始,搭建一个基于开源大模型的对话系统。包括基本的对话、与文档对话、智能体等多种功能☆10Sep 21, 2024Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- Ollama RAG using SQL Database☆12Apr 16, 2025Updated last year
- 视频理解:千问视频多模态模型 & Dify☆69Sep 2, 2024Updated last year
- Python SDK for AgentRun: Build and deploy AI Agents with Serverless runtime, sandbox execution, and enterprise-grade observability☆23Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 大模型智能体Agent中文教程,博客代码仓库☆64Nov 5, 2025Updated 6 months ago
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- 力扣题单hot100的ACM模式实现☆41Sep 2, 2025Updated 8 months ago
- CCKS举办的针对电子病例的信息抽取比赛,主要是进行医疗实体及事件抽取,本项目包括展示比赛的不断改进与多种方法的尝试,最终取得:valid第6名;test第9名。☆15Oct 10, 2021Updated 4 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- 校园论坛项目——仿牛客论坛☆21Nov 12, 2025Updated 6 months ago
- CopilotKit AI助手演示应用 - 展示前端UI与后端Agent交互☆39Jul 17, 2025Updated 10 months ago
- 人脸贴纸☆37Aug 23, 2020Updated 5 years ago
- 自己阅读的多模态对话系统论文(及部分笔记)汇总☆22Jan 5, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated 2 years ago
- 大模型推理框架加速 ,让 LLM 飞起来☆24May 10, 2024Updated 2 years ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- Image Visualization Tools for C++☆14Oct 6, 2021Updated 4 years ago
- https://mp.weixin.qq.com/s/7t0e_hfyDh1b2GPVlzXIMg 或 https://yq.aliyun.com/articles/636272☆11Aug 31, 2018Updated 7 years ago
- ☆10Jan 13, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于iris数据集进行四种机器学习算法(决策树、朴素贝叶斯、随机森林、支持向量机SVM)的训练,使用交叉检验(Cross-validation)对比了各算法的预测准确率。☆23Apr 13, 2020Updated 6 years ago
- ☆27Nov 6, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆13Apr 4, 2023Updated 3 years ago
- ncnn qt yolov6☆13Aug 4, 2022Updated 3 years ago
- Bash is All You Need. A pure Bash reimplementation of OpenClaw. No dependencies. No runtime. Runs everywhere since 2006☆97Feb 19, 2026Updated 3 months ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆12Jul 14, 2021Updated 4 years ago
- TensorFlow implementation of GhostNet: More Features from Cheap Operations.☆10Feb 6, 2020Updated 6 years ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆258Dec 10, 2025Updated 5 months ago
- A tiny KV storage based on skiplist written in Java language| 使用Java开发,基于跳表实现的轻量级键值数据库🔥🔥 🚀☆35Aug 22, 2024Updated last year
- 本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性…☆46Mar 10, 2025Updated last year
- Implementation of RetinaNet (focal loss) by TensorFlow (object detection)☆16Nov 29, 2019Updated 6 years ago
- Recommend system recall algorithms based on Pandas and Cython.☆12Apr 13, 2026Updated last month