ericperfect / libtorch_tokenizer
BERT Tokenizer in C++
☆76Updated 4 years ago
Alternatives and similar repositories for libtorch_tokenizer:
Users that are interested in libtorch_tokenizer are comparing it to the libraries listed below
- 高性能文本 Tokenizer 库☆28Updated last year
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 4 years ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆17Updated 3 years ago
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- implement bert in pure c++☆36Updated 4 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- 不用tensorflow estimator,分别采用字mask和wwm mask在中文领域内finetune bert模型☆23Updated 5 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆17Updated last year
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- ☆102Updated 4 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆75Updated 2 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- ☆90Updated last year
- Chinese MobileBERT(中文MobileBERT模型)☆90Updated 3 years ago
- 对话改写介绍文章☆97Updated last year
- C++ model train&inference framework☆224Updated 5 years ago
- 长文本相似度模型☆20Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- R-Drop方法在中文任务上的简单实验☆91Updated 3 years ago
- ☆52Updated 3 years ago
- TensorRT☆11Updated 4 years ago
- 大规模中文语料☆41Updated 5 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training s…☆80Updated 4 years ago
- ☆51Updated 4 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- ☆71Updated 2 years ago