大模型推理压测
☆47Jul 31, 2025Updated 8 months ago
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 视频理解:千问视频多模态模型 & Dify☆68Sep 2, 2024Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆61Nov 5, 2025Updated 5 months ago
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- ☆10Dec 18, 2021Updated 4 years ago
- 电商广告推荐系统☆14Jun 3, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 人脸贴纸☆37Aug 23, 2020Updated 5 years ago
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated last year
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated last year
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- Image Visualization Tools for C++☆14Oct 6, 2021Updated 4 years ago
- ☆12May 20, 2020Updated 5 years ago
- 基于iris数据集进行四种机器学习算法(决策树、朴素贝叶斯、随机森林、支持向量机SVM)的训练,使用交叉检验(Cross-validation)对比了各算法的预测准确率。☆23Apr 13, 2020Updated 6 years ago
- ☆28Nov 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- ☆13Apr 4, 2023Updated 3 years ago
- 阿里天池AI安全挑战第一期人脸识别攻击☆10Jun 26, 2020Updated 5 years ago
- ncnn qt yolov6☆13Aug 4, 2022Updated 3 years ago
- ☆10May 16, 2023Updated 2 years ago
- 本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性…☆43Mar 10, 2025Updated last year
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆235Dec 10, 2025Updated 4 months ago
- TensorFlow implementation of GhostNet: More Features from Cheap Operations.☆10Feb 6, 2020Updated 6 years ago
- Implementation of RetinaNet (focal loss) by TensorFlow (object detection)☆16Nov 29, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- training for VOC dataset☆11Nov 7, 2019Updated 6 years ago
- ☆90Jun 30, 2023Updated 2 years ago
- An Unoffical Implementation of PeleeNet by TensorFlow, Keras☆14Jun 17, 2019Updated 6 years ago
- Demonstration of the use of TensorRT and TRITON☆16Feb 9, 2021Updated 5 years ago
- the code of paper "Boundary-Sampled Halfspaces: A New Representation for Constructive Solid Modeling" (SIGGRAPH 2021)☆20Jan 18, 2024Updated 2 years ago
- ☆15Oct 9, 2018Updated 7 years ago
- The MXNet Implementation of ShuffleNet v1, v2 and MobileFaceNet☆10Feb 28, 2019Updated 7 years ago
- Co-DETR (Detection Transformer) compiled from PyTorch to NVIDIA TensorRT☆20Apr 19, 2025Updated 11 months ago
- Lightweight mobile humanpose estimation project☆14Aug 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于BiLSTM-CRF网络的中文电子病历命名实体识别☆35Feb 18, 2019Updated 7 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- 基于UNet的肝脏CT分割☆17Nov 19, 2020Updated 5 years ago
- 用于在昇腾设备上高性能推理PaddleOCR模型☆39Aug 1, 2025Updated 8 months ago
- ☆13May 16, 2025Updated 10 months ago
- ☆18Nov 28, 2022Updated 3 years ago