xiangyangkan / faster-bert-as-service
Using TensorRT and Triton Server to build BERT model as a service
☆12Updated 3 years ago
Alternatives and similar repositories for faster-bert-as-service:
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below
- TensorRT☆11Updated 4 years ago
- GLM (General Language Model)☆24Updated 2 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 4 years ago
- KuaiSearch PERKS☆11Updated 3 years ago
- Large-scale exact string matching tool☆15Updated 2 months ago
- lightweighted deep learning inference service framework☆40Updated 3 years ago
- Bert TensorRT模型加速部署☆9Updated 2 years ago
- Finetune CPM-1☆24Updated 3 years ago
- ☆52Updated 3 years ago
- BERT Tokenizer in C++☆75Updated 4 years ago
- rasa_chinese 的服务 package☆18Updated 3 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆13Updated 2 years ago
- implement bert in pure c++☆36Updated 4 years ago
- machine reading comprehension with deep learning☆20Updated 6 years ago
- aigc evals☆10Updated last year
- ChineseWord correct!!when you input some error words,return some maybe right word☆8Updated 10 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆13Updated 3 years ago
- ☆14Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆45Updated last year
- ☆25Updated 3 months ago
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆8Updated 5 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- NMT model with BERT in tensorflow 2.0☆20Updated 5 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Updated 4 years ago
- source code of EMNLP2021: A Lightweight Pretrained Model for Chinese Spelling Check☆14Updated 3 years ago
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- ☆17Updated 3 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆32Updated 3 years ago