Using TensorRT and Triton Server to build BERT model as a service
☆13Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆29Jan 7, 2021Updated 5 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- communication sur le moteur de pseudonymisation de la Cour de Cassation☆18Feb 14, 2023Updated 3 years ago
- CHIP2021医学对话临床发现阴阳性判别任务冠军方案☆16Mar 11, 2022Updated 4 years ago
- Algorithms from paper: Evaluation of Session-based Recommendation Algorithms☆16Nov 8, 2018Updated 7 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- Spot Micro Quadruped Project☆10Dec 1, 2021Updated 4 years ago
- triton server ensemble model demo☆30May 2, 2022Updated 3 years ago
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- A set of demo of deploying a Machine Learning Model in production using various methods☆61Sep 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Provide customized diffusers training and inference code for different needs☆12Jan 16, 2024Updated 2 years ago
- GBDT结合LR的二分类模型,封装成了一个类。scikit-learn风格,可以fit和predict。有run_demo☆11Sep 5, 2019Updated 6 years ago
- 带拼音、字形特征的文本纠错模型☆11Jan 1, 2023Updated 3 years ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆61Sep 28, 2020Updated 5 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- ☆20Jul 20, 2022Updated 3 years ago
- 科大讯飞线下销量挑战赛top7方案☆12Aug 21, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 电商评论观点挖掘☆44Jan 29, 2021Updated 5 years ago
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Dec 11, 2019Updated 6 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- ☆10Mar 23, 2020Updated 6 years ago
- ☆11Mar 1, 2021Updated 5 years ago
- ☆19Feb 22, 2022Updated 4 years ago
- Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NV…☆43Sep 27, 2023Updated 2 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 3 years ago
- This project is to test the adversarial training method with TextCNN text classification model.☆12Jul 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 基于苏剑林项目的复用,应用于金融事件关系抽取☆11Mar 26, 2021Updated 5 years ago
- Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling☆16Mar 9, 2021Updated 5 years ago
- One Repository of AI Series: Collecting the useful technology articles, opensource tutorials and opensource books. 我的AI系列仓库之一:收集有用的技术文章、开…☆10Nov 14, 2024Updated last year
- Visualize feature maps in convolutional neural networks.☆12Apr 3, 2018Updated 7 years ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- 训练拼音转汉字模型☆10Jan 15, 2021Updated 5 years ago
- Home surveillance system with facial recognition☆17Jun 10, 2020Updated 5 years ago