ikegami-yukino / mecab-as-kkc
Converting Mozc dictionary to MeCab dictionary for Kana-Kanji conversion (KKC)
☆13Updated 6 months ago
Alternatives and similar repositories for mecab-as-kkc:
Users that are interested in mecab-as-kkc are comparing it to the libraries listed below
- ☆10Updated 7 years ago
- A paraphrase database for Japanese text simplification☆32Updated 7 years ago
- Japanese synonym library☆53Updated 3 years ago
- Python で全角・半角・ひらがな・カタカナ等を変換する☆17Updated 8 years ago
- GPU state check script.☆23Updated last year
- DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.☆44Updated last year
- A localized word dictionary asset for University of Tsukuba☆10Updated 2 years ago
- This is the repository for TRF (text readability features) publication.☆39Updated 5 years ago
- Testing tool to verify the search qualities of the Elasticsearch indices☆29Updated 2 years ago
- ☆16Updated 5 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆22Updated 10 months ago
- 最小のサーチエンジン/PageRank/tf-idf☆19Updated last year
- ☆71Updated 6 years ago
- Viterbi-based accelerated tokenizer (Python wrapper)☆41Updated 5 months ago
- Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)☆76Updated last year
- Machine learning tasks which are used with data pipeline library "luigi" and its wrapper "gokart".☆43Updated last year
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆17Updated 2 years ago
- ☆25Updated 3 months ago
- PythonとCythonで出来てる日本語形態素解析エンジン🚧☆13Updated 5 years ago
- Show notes for https://anchor.fm/yoheikikuta.☆15Updated 2 years ago
- japanese sentence segmentation library for python☆70Updated last year
- 専門用語抽出アルゴリズムの実装の練習☆18Updated 6 years ago
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Updated 3 months ago
- 📙UNICODE絵文字の日本語読み/キーワード/分類辞書📙☆79Updated last year
- 多次元配列の属性情報を実行時に取得してコメントに書き足すツール☆68Updated last year
- Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)☆24Updated last year
- Implementation in order to operate a web API of word vector models which are generated by Word2Vec, GloVe or e.t.c.☆43Updated 9 years ago
- Python implementation of SWEM (Simple Word-Embedding-based Methods)☆29Updated 2 years ago
- ベイズ階層言語モデルによる教師なし形態素解析☆33Updated last year
- nishika akutagawa compedition 2nd prize : https://www.nishika.com/competitions/1/summary☆26Updated 4 years ago