DataXujing / Bert_TensorRTLinks
Bert TensorRT模型加速部署
☆9Updated 3 years ago
Alternatives and similar repositories for Bert_TensorRT
Users that are interested in Bert_TensorRT are comparing it to the libraries listed below
Sorting:
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆18Updated 10 months ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Updated 3 years ago
- 纯Python实现的深度学习框架,帮助你理解底层细节斩获offer☆20Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- TensorRT简明教程☆26Updated 3 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆37Updated 4 years ago
- 手摸手 美团 YOLOv6模型训练和TensorRT端到端部署方案教程☆32Updated 3 years ago
- YOLOv5 in PyTorch > ONNX > CoreML > iOS☆9Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Updated 3 years ago
- 基于rknn的yolov5的cpp实现,包含各种依赖库,是一个完整工程,可直接编译运行☆20Updated 3 years ago
- miemienet is a C++ AI deep learning inference framework.Supports PPYOLOE、PICODET.☆11Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- paper-read-notes☆12Updated 10 months ago
- Whisper in TensorRT-LLM☆16Updated last year
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆34Updated 2 years ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- HunyuanDiT with TensorRT and libtorch☆17Updated last year
- ☆27Updated last month
- For 2022 Nvidia Hackathon☆22Updated 3 years ago
- 使用opencv部署DBNet文字检测,包含C++和Python两种版本的实现☆33Updated 4 years ago
- 文档图片表格结构识别算法-同花顺算法挑战赛-2022年2-4月春季赛☆25Updated 3 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- ☆14Updated 5 years ago
- Tensorflow Basic Sample Code☆19Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆28Updated 4 years ago