DataXujing / Bert_TensorRT
Bert TensorRT模型加速部署
☆9Updated 2 years ago
Alternatives and similar repositories for Bert_TensorRT:
Users that are interested in Bert_TensorRT are comparing it to the libraries listed below
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Updated 2 years ago
- TensorRT简明教程☆26Updated 3 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- 文档图片表格结构识别算法-同花顺算法挑战赛-2022年2-4月春季赛☆25Updated 3 years ago
- 手摸手 美团 YOLOv6模型训练和TensorRT端到端部署方案教程☆30Updated 2 years ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆35Updated 3 years ago
- YOLOv5 in PyTorch > ONNX > CoreML > iOS☆9Updated 7 months ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Updated 3 years ago
- Using TensorRT and Triton Server to build BERT model as a service☆13Updated 3 years ago
- ☆21Updated 7 months ago
- 利用tensorflow/serving进行单模型、多模型、同一模型多版本的部署,并进行模型预测,并用Prothemus进行服务监控。☆11Updated 4 years ago
- miemienet is a C++ AI deep learning inference framework.Supports PPYOLOE、PICODET.☆11Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- Whisper in TensorRT-LLM☆15Updated last year
- ☆17Updated 3 years ago
- 纯Python实现的深度学习框架,帮助你理解底层细节斩获offer☆20Updated 2 years ago
- 使用ONNXRuntime部署PP-YOLOE目标检测,支持PP-YOLOE-s、PP-YOLOE-m、PP-YOLOE-l、PP-YOLOE-x四种结构,包含C++和Python两个版本的程序☆18Updated 2 years ago
- A Special YOLOv5s-6.0 Deploy TensorRT☆8Updated 2 years ago
- A curated list of transformer learning materials, shared blogs, technical reviews.☆28Updated 4 years ago
- Pytorch2Caffe & Caffe2Pytorch☆8Updated 6 years ago
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆28Updated 4 years ago
- ☆13Updated last year
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- For 2022 Nvidia Hackathon☆20Updated 2 years ago
- 使用opencv部署DBNet文字检测,包含C++和Python两种版本的实现☆33Updated 3 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year