xiangyangkan / faster-bert-as-serviceLinks
Using TensorRT and Triton Server to build BERT model as a service
☆13Updated 3 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below
Sorting:
- TensorRT☆11Updated 5 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆36Updated 5 years ago
- ☆52Updated 4 years ago
- rasa_chinese 的服务 package☆18Updated 4 years ago
- BERT Tokenizer in C++☆78Updated 4 years ago
- implement bert in pure c++☆36Updated 5 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 3 years ago
- KuaiSearch PERKS☆12Updated 4 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Updated 5 years ago
- Using FasterTransformer for accelerating the predict speed of bert and roberta☆14Updated 6 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Updated 4 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- Bert TensorRT模型加速部署☆10Updated 3 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Updated 4 years ago
- pytorch during training, libtorch during serving via gRPC☆21Updated 6 years ago
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Updated 11 months ago
- Finetune CPM-1☆24Updated 4 years ago
- This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.☆59Updated 2 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model☆29Updated 3 years ago
- ☆15Updated 5 years ago
- We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training s…☆81Updated 5 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Updated 4 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- Facebook faiss相关的python接口☆15Updated 5 years ago
- Knowledge Graph based Question Answering benchmark.☆10Updated 5 years ago
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆10Updated 5 years ago
- bert-of-theseus via bert4keras☆31Updated 5 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12Updated 5 years ago