ericperfect / libtorch_tokenizerLinks
BERT Tokenizer in C++
☆76Updated 4 years ago
Alternatives and similar repositories for libtorch_tokenizer
Users that are interested in libtorch_tokenizer are comparing it to the libraries listed below
Sorting:
- 高性能文本 Tokenizer 库☆29Updated last year
- C++ model train&inference framework☆223Updated 5 years ago
- implement bert in pure c++☆35Updated 5 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 4 years ago
- lightweighted deep learning inference service framework☆39Updated 4 years ago
- Chinese MobileBERT(中文MobileBERT模型)☆94Updated 3 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆17Updated last year
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆165Updated 5 years ago
- 不用tensorflow estimator,分别采用字mask和wwm mask在中文领域内finetune bert模型☆23Updated 5 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated 2 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆17Updated 3 years ago
- ☆101Updated 4 years ago
- 大规模中文语料☆42Updated 5 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Updated 2 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆130Updated 2 years ago
- TensorRT☆11Updated 4 years ago
- RoFormer升级版☆153Updated 2 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 5 years ago
- 基于bert进行中文文本纠错☆235Updated 2 years ago
- ☆90Updated 2 years ago
- ☆52Updated 4 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Updated 3 years ago
- 一个基于预训练的句向量生成工具☆137Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- 时间抽取、解析、标准化工具☆53Updated 2 years ago