xiangyangkan / faster-bert-as-serviceLinks
Using TensorRT and Triton Server to build BERT model as a service
☆13Updated 3 years ago
Alternatives and similar repositories for faster-bert-as-service
Users that are interested in faster-bert-as-service are comparing it to the libraries listed below
Sorting:
- Minimal example of using a traced huggingface transformers model with libtorch☆36Updated 4 years ago
- TensorRT☆11Updated 4 years ago
- ☆52Updated 4 years ago
- BERT Tokenizer in C++☆77Updated 4 years ago
- rasa_chinese 的服务 package☆18Updated 4 years ago
- implement bert in pure c++☆36Updated 5 years ago
- KuaiSearch PERKS☆12Updated 3 years ago
- lightweighted deep learning inference service framework☆40Updated 4 years ago
- This repo contains a PyTorch implementation of a pretrained ERNIE model for text classification.☆59Updated 2 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Updated 5 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated 2 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Updated 4 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year
- Facebook faiss相关的python接口☆15Updated 5 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Updated 4 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆99Updated 9 months ago
- ☆16Updated 5 years ago
- ☆15Updated 4 years ago
- Neutral Network based Chinese Segment System☆18Updated 8 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Updated 3 years ago
- Finetune CPM-1☆24Updated 4 years ago
- Bert TensorRT模型加速部署☆10Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Updated 4 years ago
- This is a chinese Bert model specific for question answering☆27Updated 6 years ago
- NMT model with BERT in tensorflow 2.0☆20Updated 6 years ago
- bert-of-theseus via bert4keras☆31Updated 5 years ago
- Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging☆35Updated 6 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Updated 5 years ago