xiangyangkan / faster-bert-as-serviceLinks
Using TensorRT and Triton Server to build BERT model as a service
☆13Updated 3 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below
Sorting:
- Minimal example of using a traced huggingface transformers model with libtorch☆36Updated 5 years ago
- TensorRT☆11Updated 5 years ago
- rasa_chinese 的服务 package☆18Updated 4 years ago
- implement bert in pure c++☆36Updated 5 years ago
- ☆52Updated 4 years ago
- Bert TensorRT模型加速部署☆10Updated 3 years ago
- KuaiSearch PERKS☆12Updated 3 years ago
- lightweighted deep learning inference service framework☆40Updated 4 years ago
- BERT Tokenizer in C++☆78Updated 4 years ago
- AI Challenger 2018 阅读理解赛道代码分享☆21Updated 6 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Updated 4 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Updated 4 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Updated 5 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- Finetune CPM-1☆24Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year
- Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NV…☆43Updated 2 years ago
- 2019达观杯实体识别☆19Updated 6 years ago
- ml模型分布式服务部署:grpc,flask;docker☆76Updated 4 years ago
- NMT model with BERT in tensorflow 2.0☆20Updated 6 years ago
- huggingface ChineseBert Tokenizer☆16Updated 3 years ago
- ☆15Updated last year
- MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices☆69Updated 5 years ago
- This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.☆59Updated 2 years ago
- pytorch during training, libtorch during serving via gRPC☆21Updated 6 years ago
- ☆36Updated last year
- Using FasterTransformer for accelerating the predict speed of bert and roberta☆14Updated 6 years ago
- 在tensor2tensor中使用自己的语料实现中英文翻译☆23Updated 6 years ago