xiangyangkan / faster-bert-as-serviceLinks
Using TensorRT and Triton Server to build BERT model as a service
☆13Updated 4 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below
Sorting:
- TensorRT☆11Updated 5 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 5 years ago
- ☆52Updated 4 years ago
- implement bert in pure c++☆36Updated 5 years ago
- BERT Tokenizer in C++☆79Updated 4 years ago
- Finetune CPM-1☆24Updated 4 years ago
- KuaiSearch PERKS☆12Updated 4 years ago
- Bert TensorRT模 型加速部署☆10Updated 3 years ago
- rasa_chinese 的服务 package☆18Updated 4 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model☆30Updated 3 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Updated 5 years ago
- lightweighted deep learning inference service framework☆39Updated 4 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- huggingface ChineseBert Tokenizer☆16Updated 3 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆16Updated 2 years ago
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Updated 5 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to train…☆22Updated 6 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆10Updated 5 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 3 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- pytorch during training, libtorch during serving via gRPC☆21Updated 6 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Updated 2 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated 2 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Updated 4 years ago
- 国内外数据竞赛资讯整理☆18Updated 4 years ago