SamuraiT / tinysegmenter
tokenizer specified for Japanese
☆49Updated 3 years ago
Alternatives and similar repositories for tinysegmenter:
Users that are interested in tinysegmenter are comparing it to the libraries listed below
- A fast converter between Japanese hankaku and zenkaku characters☆145Updated last year
- ☆97Updated 6 years ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆85Updated 4 years ago
- A comparison tool of Japanese tokenizers☆121Updated 9 months ago
- CaboCha wrapper for Python3☆47Updated 6 years ago
- Word List by Semantic Principles (WLSP): “It is a collection of words classified and arranged by their meanings”☆53Updated 4 years ago
- A Python Module for JUMAN++/KNP☆89Updated 2 weeks ago
- Mozc for Python: Kana-Kanji converter☆44Updated last month
- text-only archives of www.aozora.gr.jp☆78Updated 2 years ago
- A tool for building gensim word2vec model for Japanese.☆93Updated 8 years ago
- Kanjize(カンジャイズ): Easy converter between Kanji-Number and Integer☆60Updated last month
- Google Chrome Extension: Display "English Ruby(RUBI)" in Japanese.☆45Updated 7 years ago
- Yet Another Japanese Dependency Structure Analyzer☆111Updated last month
- A paraphrase database for Japanese text simplification☆32Updated 8 years ago
- python版日本語意味役割付与システム(ASA)☆23Updated 2 years ago
- Rakuten MA (Python version)☆22Updated 7 years ago
- japanese sentence segmentation library for python☆70Updated last year
- HTMLから本文抽出を行うextractcontent.rb の Python3版☆23Updated 5 years ago
- Solr / Elasticsearch Synonym mapping file for Japanese web documents using results of NEologd☆39Updated 9 years ago
- MeCabを利用した日本語形態素解析WebAPI☆40Updated last week
- Python で全角・半角・ひらがな・カタカナ等を変換する☆17Updated 8 years ago
- Laboro BERT Japanese: Japanese BERT Pre-Trained With Web-Corpus☆73Updated 2 years ago
- 首都大日本語 Twitter コーパス☆21Updated 9 years ago
- RESTful MeCab on Docker☆50Updated 6 years ago
- A lexicon for Sudachi☆245Updated last month
- hottoSNS-w2v: 日本語大規模SNS+Webコーパスによる単語分散表現モデル☆60Updated 3 months ago
- English-Japanese dictionary☆61Updated 7 years ago
- 📙UNICODE絵文字の日本語読 み/キーワード/分類辞書📙☆79Updated last year
- Flatten nested iterable object for Python (Pure-Python implementation)☆28Updated 5 years ago
- Implementation in order to operate a web API of word vector models which are generated by Word2Vec, GloVe or e.t.c.☆43Updated 9 years ago