ueda-keisuke / CC-CEDICT-MeCab
CC-CEDICT-MeCab is a MeCab dictionary for Chinese (Mandarin) text segmentation
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for CC-CEDICT-MeCab
- 🦞 Rust library of natural language dictionaries using character-wise double-array tries.☆28Updated last year
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆17Updated 2 years ago
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆31Updated 8 months ago
- ☆21Updated 2 weeks ago
- A Japanese Morphological Analyzer written in pure Rust☆26Updated 5 years ago
- Japanese tokenizer for rust☆34Updated 5 years ago
- python版日本語意味役割付与システム(ASA)☆23Updated 2 years ago
- A lidera japanese tokenizer wrapper for javascript and typescript☆14Updated 2 years ago
- ☆15Updated 11 months ago
- Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)☆23Updated 9 months ago
- Yet another sentence-level tokenizer for the Japanese text☆22Updated 2 years ago
- sqlite3 fts5 mecab☆17Updated 5 years ago
- Japanese synonym library☆52Updated 2 years ago
- Finding all pairs of similar documents time- and memory-efficiently☆58Updated 2 years ago
- Rakuten MA (Python version)☆22Updated 7 years ago
- Unidic packaged for installation via pip.☆77Updated last year
- Safe Rust bindings for mecab a part-of-speech and morphological analyzer library☆57Updated last year
- Rust implementation of SIF and uSIF: Simple and fast sentence embedding☆19Updated 11 months ago
- Lindera tokenizer for Tantivy.☆54Updated 4 months ago
- A Japanese law parser☆12Updated 9 months ago
- A Japanese dependency parser based on BERT☆22Updated 2 years ago
- Japanese text preprocessor for Text-to-Speech applications (OpenJTalk rewrite in rust language)☆35Updated this week
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆229Updated this week
- A small version of UniDic for easy pip installs.☆38Updated 4 years ago
- ☆10Updated 3 weeks ago
- Annotated Fuman Kaitori Center Corpus☆17Updated 10 months ago
- Code for COLING 2020 Paper☆13Updated last week
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Updated 4 years ago
- ☆46Updated last year
- japanese sentence segmentation library for python☆68Updated last year