Using TensorRT and Triton Server to build BERT model as a service
☆13Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 4 years ago
- Tutorial from @rwieruch's Road to Learn React☆14Sep 2, 2018Updated 7 years ago
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆29Jan 7, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- communication sur le moteur de pseudonymisation de la Cour de Cassation☆18Feb 14, 2023Updated 3 years ago
- CHIP2021医学对话临床发现阴阳性判别任务冠军方案☆17Mar 11, 2022Updated 4 years ago
- Algorithms from paper: Evaluation of Session-based Recommendation Algorithms☆16Nov 8, 2018Updated 7 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- triton server ensemble model demo☆30May 2, 2022Updated 4 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- Provide customized diffusers training and inference code for different needs☆12Jan 16, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GBDT结合LR的二分类模型,封装成了一个类。scikit-learn风格,可以fit和predict。有run_demo☆11Sep 5, 2019Updated 6 years ago
- ☆15Aug 3, 2025Updated 9 months ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 5 years ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆61Sep 28, 2020Updated 5 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- 医院体检报告信息抽取及模板生成☆12Apr 25, 2019Updated 7 years ago
- 科大讯飞线下销量挑战赛top7方案☆13Aug 21, 2021Updated 4 years ago
- 电商评论观点挖掘☆45Jan 29, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Dec 11, 2019Updated 6 years ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- ☆18Feb 22, 2022Updated 4 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 4 years ago
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- 基于苏剑林项目的复用,应用于金融事件关系抽取☆11Mar 26, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 2019搜狐第三届内容识别挑战赛rank10☆11Oct 17, 2019Updated 6 years ago
- Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling☆16Mar 9, 2021Updated 5 years ago
- Visualize feature maps in convolutional neural networks.☆12Apr 3, 2018Updated 8 years ago
- 中文事件抽取☆11Feb 26, 2021Updated 5 years ago
- 训练拼音转汉字模型☆10Jan 15, 2021Updated 5 years ago
- tf&torch about nlp☆11Aug 12, 2022Updated 3 years ago
- 本项目使用Keras实现R-BERT,在人物关系数据集上进行测试验证。☆10Apr 17, 2021Updated 5 years ago