Using TensorRT and Triton Server to build BERT model as a service
☆13Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 5 years ago
- Tutorial from @rwieruch's Road to Learn React☆14Sep 2, 2018Updated 7 years ago
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆29Jan 7, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- CHIP2021医学对话临床发现阴阳性判别任务冠军方案☆17Mar 11, 2022Updated 4 years ago
- Algorithms from paper: Evaluation of Session-based Recommendation Algorithms☆16Nov 8, 2018Updated 7 years ago
- MXNet Model Serving☆25Oct 4, 2017Updated 8 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- A set of demo of deploying a Machine Learning Model in production using various methods☆61Sep 22, 2021Updated 4 years ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- Provide customized diffusers training and inference code for different needs☆12Jan 16, 2024Updated 2 years ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆61Sep 28, 2020Updated 5 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- The simple demo of `Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval`☆14Dec 9, 2024Updated last year
- ☆21Jul 20, 2022Updated 3 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- 医院体检报告信息抽取及模板 生成☆12Apr 25, 2019Updated 7 years ago
- 科大讯飞线下销量挑战赛top7方案☆13Aug 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 电商评论观点挖掘☆44Jan 29, 2021Updated 5 years ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Dec 11, 2019Updated 6 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- ☆10Mar 23, 2020Updated 6 years ago
- ☆11Mar 1, 2021Updated 5 years ago
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- ☆18Feb 22, 2022Updated 4 years ago
- 可以将bert相关模型在网站上进行展示、测试,后端使用flask,前端使用vue。☆11Aug 20, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 4 years ago
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago
- This project is to test the adversarial training method with TextCNN text classification model.☆12Jul 8, 2021Updated 4 years ago
- 基于苏剑林项目的复用,应用于金融事件关系抽取☆11Mar 26, 2021Updated 5 years ago
- 2019搜狐第三届内容识别挑战赛rank10☆11Oct 17, 2019Updated 6 years ago
- One Repository of AI Series: Collecting the useful technology articles, opensource tutorials and opensource books. 我的AI系列仓库之一:收集有用的技术文章、开…☆10Nov 14, 2024Updated last year
- Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling☆16Mar 9, 2021Updated 5 years ago