byronhe / cppjiebaLinks
"结巴"中文分词的C++版本,使用 darts Double Array Trie 降低内存占用到 1/100
☆53Updated 3 years ago
Alternatives and similar repositories for cppjieba
Users that are interested in cppjieba are comparing it to the libraries listed below
Sorting:
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Updated 3 years ago
- Edge Machine Learning Library☆199Updated 3 years ago
- KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite☆96Updated 3 years ago
- A clone of Darts (Double-ARray Trie System)☆157Updated 8 months ago
- ☆102Updated 3 years ago
- 词语拼音数据☆510Updated 6 months ago
- BERT Tokenizer in C++☆79Updated 5 years ago
- a Chinese tokenizer☆18Updated 12 years ago
- The simple header file library of CppJieba☆41Updated 10 years ago
- Onnxruntime Builder☆69Updated 3 months ago
- 最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。NLP,机器人工程必备! The Best Tool of Chinese Number to Digits☆371Updated 2 years ago
- CppJieba的C语言api☆60Updated 3 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆147Updated last year
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆274Updated 5 years ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆211Updated 8 months ago
- 高性能文本 Tokenizer 库☆32Updated last year
- C++ headers(hpp) library with Python style.☆139Updated 5 months ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆545Updated 2 years ago
- C++ model train&inference framework☆223Updated 6 years ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆22Updated 4 years ago
- onnxruntime pre-compiled libs☆168Updated 2 weeks ago
- Efficient inference of large language models.☆149Updated 4 months ago
- ☆127Updated 4 years ago
- 《声纹技术:从核心算法到工程实践》☆175Updated 3 years ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆69Updated 6 months ago
- 拼音转汉字, 拼音输入法引擎, pin yin -> 拼音☆628Updated 8 months ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆481Updated last year
- 聊天机器人,自然语言理解,语义理解☆409Updated 2 years ago
- A demo of zh/Chinese Text to Speech system run on CPU in real time. 中文实时语音合成系统Demo☆181Updated 3 years ago