FlyAIBox/llm_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FlyAIBox/llm_benchmark)

FlyAIBox / llm_benchmark

大模型推理压测

☆50

Alternatives and similar repositories for llm_benchmark

Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
Miraclemarvel55 / LLaMA-MOSS-RLHF-LoRA
View on GitHub
用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]
☆21May 16, 2023Updated 3 years ago
huangjia2019 / rag-project03-audit
View on GitHub
[RAG训练营] u.geekbang.org/subject/airag/1009927 ESG合规审计系统 - 可持续发展报告检查工具
☆39Jun 1, 2025Updated last year
nikoHu / ChatForResearch
View on GitHub
这个库用于从零开始，搭建一个基于开源大模型的对话系统。包括基本的对话、与文档对话、智能体等多种功能
☆10Sep 21, 2024Updated last year
DataXujing / DeepSeek-R1-Android
View on GitHub
安卓手机部署DeepSeek-R1 蒸馏的1.5B模型
☆24Feb 4, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LLMServe / FastServe
View on GitHub
☆29Sep 26, 2025Updated 10 months ago
ZBayes / poc_project
View on GitHub
通用简单工具项目
☆22Oct 6, 2024Updated last year
pingcy / a2a-demo
View on GitHub
A2A协议智能体通信演示项目 - 支持流式响应、推送通知和文件附件的智能体客户端
☆21Jun 9, 2025Updated last year
shuttie / embed-benchmark
View on GitHub
☆16Nov 10, 2023Updated 2 years ago
FlyAIBox / dcu-in-action
View on GitHub
国产加速卡-海光DCU实战（大模型训练、微调、推理等）
☆94Aug 10, 2025Updated 11 months ago
lework / llm-benchmark
View on GitHub
LLM 并发性能测试工具，支持自动化压力测试和性能报告生成。
☆268Dec 10, 2025Updated 7 months ago
headacheboy / IGSQL
View on GitHub
☆28Nov 15, 2020Updated 5 years ago
exoskeletonzj / MARS
View on GitHub
A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization
☆18Dec 15, 2025Updated 7 months ago
yangminghuan / e-commerce-ad-rec-sys
View on GitHub
电商广告推荐系统
☆14Jun 3, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sunshine-JLU / deepseek-r1-distill-llama-8b-lora
View on GitHub
The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.
☆17Feb 19, 2025Updated last year
azukiapp / docker-php-fpm
View on GitHub
PHP with FPM Dockerfile for trusted automated Docker builds.
☆12Mar 2, 2016Updated 10 years ago
mathinml / knowbase
View on GitHub
☆16Apr 30, 2025Updated last year
obaby / baby_wx_post_spider
View on GitHub
微信公众号文章爬虫，基于selenium登录微信公众平台后进行爬取。
☆12Jun 16, 2020Updated 6 years ago
Franc-Z / QWen1.5_TensorRT-LLM
View on GitHub
Optimize QWen1.5 models with TensorRT-LLM
☆17May 14, 2024Updated 2 years ago
wenzhaoabc / mmkg-rag
View on GitHub
Enhancing Retrieval-Augmented Generation with Multi-Modal Knowledge Graph Integration
☆15Feb 28, 2026Updated 4 months ago
mu-zi-lee / Page-agent-UI
View on GitHub
基于Page agent的个人UI/UX 优化 Chrome 插件
☆17Mar 13, 2026Updated 4 months ago
jtchen2k / ECNU-class2ics
View on GitHub
华东师范大学课程表导出工具
☆19Jan 1, 2022Updated 4 years ago
Feywell / mixup_mxnet
View on GitHub
train cifar10 example with mixup method
☆10Dec 30, 2017Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
liuhuanyong / OntologyDrivenSmartCampusAdvisor
View on GitHub
本体驱动的简单demo，基于知识图谱的智能校园课程与职业规划顾问。通过本体建模 + 规则推理 + 自然语言问答，为学生提供选课建议、职业规划、技能缺口分析等服务，并完整可视化每一步推理路径。
☆20Jul 17, 2026Updated last week
pydaxing / clip_blip_embedding_rag
View on GitHub
在RAG技术中，嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务，该服务支持文本和图像的嵌入生成与相似度计算，为多模态信息检索提供了基础能力。
☆42Dec 28, 2024Updated last year
zRzRzRzRzRzRzR / lm-fly
View on GitHub
大模型推理框架加速，让 LLM 飞起来
☆24May 10, 2024Updated 2 years ago
IIMarch / pseudo-3d-residual-networks-mxnet
View on GitHub
mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported
☆13Jul 27, 2018Updated 8 years ago
QunBB / bert-pretraining
View on GitHub
BERT&RoBERTa预训练代码，tensorflow和torch两种版本实现
☆13Feb 8, 2023Updated 3 years ago
azharlabs / large-models
View on GitHub
☆15Feb 7, 2024Updated 2 years ago
aotuai / brainframe-python
View on GitHub
🧠🖼️🐍 A Python wrapper around the BrainFrame REST API
☆12Jun 28, 2026Updated 3 weeks ago
ontio / ontology-ddxf
View on GitHub
Distributed data eXchange Framework,which allows to build data marketplaces .
☆20May 6, 2019Updated 7 years ago
jjovalle99 / fastapi-langgraph-template
View on GitHub
Accelerate AI development
☆17Jul 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aliyun / kvc-3fs-operator
View on GitHub
☆42Apr 16, 2026Updated 3 months ago
wkentaro / imgviz-cpp
View on GitHub
Image Visualization Tools for C++
☆14Oct 6, 2021Updated 4 years ago
BUAADreamer / Qwen2-VL-History
View on GitHub
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
☆15Sep 17, 2024Updated last year
hopef / llama3_chat
View on GitHub
Llama3 Streaming Chat Sample
☆22Apr 24, 2024Updated 2 years ago
rbatis / fast_pool
View on GitHub
a fast async pool based on channel
☆26Apr 22, 2026Updated 3 months ago
SCNU203 / Math23k
View on GitHub
The Math23k dataset for downloading
☆22Apr 16, 2022Updated 4 years ago
guanguanboy / FaceQualityEvaluation
View on GitHub
☆10Jan 13, 2020Updated 6 years ago