大模型推理压测
☆48Jul 31, 2025Updated 10 months ago
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆47Aug 7, 2025Updated 10 months ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- CanvasAnvil is an AI multi-canvas creation platform for flowcharts, interior design, presentations, posters, infographics, and product st…☆84May 31, 2026Updated 2 weeks ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆23Mar 18, 2026Updated 2 months ago
- 视频理解:千问视频多模态模型 & Dify☆70Sep 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 根据XYBot V2 进行的二次封装,加入了web,和插件市场☆15Apr 2, 2025Updated last year
- ☆31Jul 22, 2021Updated 4 years ago
- Python SDK for AgentRun: Build and deploy AI Agents with Serverless runtime, sandbox execution, and enterprise-grade observability☆26Jun 6, 2026Updated last week
- 大模型智能体Agent中文教程,博客代码仓库☆64Nov 5, 2025Updated 7 months ago
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- 电商广告推荐系统☆14Jun 3, 2022Updated 4 years ago
- ☆12Apr 2, 2026Updated 2 months ago
- Open Hackathon Playbook☆27May 9, 2026Updated last month
- 力扣题单hot100的ACM模式实现☆44Sep 2, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- CopilotKit AI助手演示应用 - 展示前端UI与后端Agent交互☆39Jul 17, 2025Updated 10 months ago
- 华东师范大学课程表导出工具☆19Jan 1, 2022Updated 4 years ago
- 自己阅读的多模态对话系统论文(及部分笔记)汇总☆22Jan 5, 2023Updated 3 years ago
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated 2 years ago
- train cifar10 example with mixup method☆10Dec 30, 2017Updated 8 years ago
- 大模型推理框架加速,让 LLM 飞起来☆24May 10, 2024Updated 2 years ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- ☆12May 20, 2020Updated 6 years ago
- https://mp.weixin.qq.com/s/7t0e_hfyDh1b2GPVlzXIMg 或 https://yq.aliyun.com/articles/636272☆11Aug 31, 2018Updated 7 years ago
- ☆27Nov 6, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- CGED & CSC☆23Feb 27, 2020Updated 6 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Apr 4, 2023Updated 3 years ago
- 阿里天池AI安全挑战第一期人脸识别攻击☆10Jun 26, 2020Updated 5 years ago
- ☆10May 16, 2023Updated 3 years ago
- ncnn qt yolov6☆13Aug 4, 2022Updated 3 years ago
- Bash is All You Need. A pure Bash reimplementation of OpenClaw. No dependencies. No runtime. Runs everywhere since 2006☆100Feb 19, 2026Updated 3 months ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- ☆12Jul 14, 2021Updated 4 years ago