byronhe / cppjieba
"结巴"中文分词的C++版本,使用 darts Double Array Trie 降低内存占用到 1/100
☆48Updated 2 years ago
Alternatives and similar repositories for cppjieba:
Users that are interested in cppjieba are comparing it to the libraries listed below
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆15Updated 2 years ago
- 词语拼音数据☆462Updated last week
- A clone of Darts (Double-ARray Trie System)☆143Updated 6 years ago
- BERT Tokenizer in C++☆75Updated 4 years ago
- The simple header file library of CppJieba☆40Updated 9 years ago
- CppJieba的C语言api☆56Updated 2 years ago
- mmseg 分词算法c++实现☆33Updated 9 years ago
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆267Updated 4 years ago
- C++ model train&inference framework☆223Updated 5 years ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆207Updated last year
- KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite☆93Updated 2 years ago
- a kws demo on android☆36Updated 7 months ago
- C++ headers(hpp) library with Python style.☆130Updated 2 weeks ago
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆21Updated 3 years ago
- 高性能文本 Tokenizer 库☆28Updated 11 months ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆445Updated 9 months ago
- wrap cppjieba by swig.☆17Updated 6 years ago
- A library for adding punctuation into a text from ASR.☆16Updated last year
- simple-pinyin 基于隐马尔可夫模型的简易拼音输入法(拼音转汉字)☆46Updated 4 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆51Updated last week
- A c++ client library for redis cluster.☆12Updated 8 years ago
- Real time vector search engine☆139Updated last year
- 基于 g2pW 提升 pypinyin 的准确性☆82Updated last year
- An Efficient Lexical Analyzer for Chinese☆39Updated 5 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆119Updated 5 years ago
- TTS inference in C++ based on TFlite model☆18Updated 4 years ago
- A simple TTS(text-to-speech) engine for Chinese mandarin☆19Updated 12 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆134Updated 3 weeks ago