Using TensorRT and Triton Server to build BERT model as a service
☆13Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Tutorial from @rwieruch's Road to Learn React☆14Sep 2, 2018Updated 7 years ago
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆29Jan 7, 2021Updated 5 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- CHIP2021医学对话临床发现阴阳性判别任务冠军方案☆17Mar 11, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Algorithms from paper: Evaluation of Session-based Recommendation Algorithms☆16Nov 8, 2018Updated 7 years ago
- Non Metric Space ( Approximate ) Library in R