ericperfect / libtorch_tokenizerLinks

BERT Tokenizer in C++

☆77

Alternatives and similar repositories for libtorch_tokenizer

Users that are interested in libtorch_tokenizer are comparing it to the libraries listed below

Sorting:

LieluoboAi / radish
C++ model train&inference framework
☆223Updated 5 years ago
zejunwang1 / easytokenizer
高性能文本 Tokenizer 库
☆29Updated last year
LeeJuly30 / BERTCpp
implement bert in pure c++
☆35Updated 5 years ago
thuwyh / InferLight
lightweighted deep learning inference service framework
☆39Updated 4 years ago
Peter-Chou / transformer_cpp_tokenizers
transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)
☆17Updated 3 years ago
dhpollack / huggingface_libtorch
Minimal example of using a traced huggingface transformers model with libtorch
☆35Updated 4 years ago
mattzheng / py-kenlm-model
python | 高效使用统计语言模型kenlm：新词发现、分词、智能纠错等
☆166Updated 5 years ago
ymcui / Chinese-MobileBERT
Chinese MobileBERT（中文MobileBERT模型）
☆95Updated 3 years ago
CLUEbenchmark / MobileQA
离线端阅读理解应用 QA for mobile, Android & iPhone
☆60Updated 2 years ago
yanqiuxia / BERT-PreTrain
不用tensorflow estimator，分别采用字mask和wwm mask在中文领域内finetune bert模型
☆23Updated 5 years ago
Mitomzhou / ASRT_SR_tensorflow2.0
基于深度学习识别THCHS30数据集
☆14Updated 3 years ago
pluto-junzeng / C4-zh
大规模中文语料
☆43Updated 5 years ago
taishan1994 / Gector_chinese
基于seq2edit (Gector) 的中文文本纠错。
☆29Updated 2 years ago
Tlntin / ChatGLM2-6B-TensorRT
☆90Updated 2 years ago
tongchangD / bert_for_corrector
基于bert进行中文文本纠错
☆235Updated 2 years ago
zejunwang1 / bert4vec
一个基于预训练的句向量生成工具
☆137Updated 2 years ago
zhongerqiandan / pretrained-unilm-Chinese
中文版unilm预训练模型
☆82Updated 4 years ago
zejunwang1 / darmatch
一个非常高效的字符串匹配工具，支持正向/反向最大匹配分词和多模式字符串精确匹配
☆17Updated 2 years ago
CLUEbenchmark / PyCLUE
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
☆131Updated 2 years ago
bojone / nezha_gpt_dialog
☆101Updated 4 years ago
ZhuiyiTechnology / roformer-v2
RoFormer升级版
☆153Updated 2 years ago
zhusleep / tagger_rewriter
对话改写介绍文章
☆97Updated 2 years ago
charlesXu86 / char_featurizer
汉字字符特征提取工具，可以提取出字符中的字音（声母、韵母、声调）、字形（偏旁、部首）、四角编码等特征，同时可作为tensor输入到模型
☆136Updated 5 years ago
liushulinle / PLOME
Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021
☆237Updated 2 years ago
Lisennlp / TinyBert
简洁易用版TinyBert：基于Bert进行知识蒸馏的预训练语言模型
☆266Updated 4 years ago
jiangtaojy / mlm_bert_traning
基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型，pinyin correcter，基于pytorch框架实现
☆45Updated 4 years ago
renatoviolin / BERT-cpp-inference
☆52Updated 4 years ago
BshoterJ / Text-Matching
This repo contains some experiments of text matching on Chinese dataset LCQMC
☆27Updated 5 years ago
gitabtion / SoftMaskedBert-PyTorch
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.
☆95Updated 4 years ago
whgaara / tensorflow-faspell
☆23Updated 4 years ago