jdeng / mmseg
MMSEG simple word segmenter in C++ 11
☆17Updated 10 years ago
Alternatives and similar repositories for mmseg:
Users that are interested in mmseg are comparing it to the libraries listed below
- mmseg 分词算法c++实现☆33Updated 9 years ago
- wrap cppjieba by swig.☆20Updated 7 years ago
- ctbparser是一个用C++语言实现的开源的中文处理工具包(GBK编码),用于分词、词性标注、依存句法分析,采用的是中文宾州树库(Chinese Tree Bank, CTB)标准。☆12Updated 10 years ago
- a Chinese tokenizer☆17Updated 11 years ago
- C++ headers(hpp) library with Python style.☆131Updated 3 months ago
- CppJieba的C语言api☆56Updated 2 years ago
- Chinese processing☆36Updated 11 years ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago
- auto generate chinese words in huge text.☆91Updated 10 years ago
- ☆59Updated 8 months ago
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Updated 7 years ago
- An effective C/C++ compute frame☆50Updated 9 years ago
- A clone of Darts (Double-ARray Trie System)☆145Updated 6 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆31Updated 7 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 8 years ago
- 微型中文关键词抽取服务☆55Updated 7 years ago
- CppJieba 的 Nginx 扩展模块☆10Updated 10 years ago
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆74Updated 9 years ago
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- A TensorFlow implementation of WaveGlow☆8Updated 6 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.☆13Updated 9 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Google word2vec tools built for windows compiled with visual studio 2017 and dev c++ on Windows 10 x64.☆14Updated 7 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- A single file C++17 header-only Minimal Acyclic Subsequential Transducers, or Finite State Transducers☆55Updated 2 years ago
- Scala implementation of Aho-Corasick algorithm☆15Updated 2 years ago
- 基于字典树的简单脏词匹配替换类 A trie-tree-based dirty word checker for c++.☆26Updated 9 years ago
- simple http service framework☆18Updated 8 months ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- "结巴"中文分词的C++版本,使用 darts Double Array Trie 降低内存占用到 1/100☆49Updated 2 years ago