ikawaha / kagome-dict
Dictionary Library for Kagome v2
☆11Updated last week
Alternatives and similar repositories for kagome-dict:
Users that are interested in kagome-dict are comparing it to the libraries listed below
- Safe Rust bindings for mecab a part-of-speech and morphological analyzer library☆61Updated last year
- 『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ☆19Updated last year
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆233Updated this week
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆34Updated 11 months ago
- Testing tool to verify the search qualities of the Elasticsearch indices☆29Updated 2 years ago
- Japanese synonym library☆53Updated 3 years ago
- ☆29Updated 3 years ago
- DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.☆44Updated last year
- IPAdic packaged for easy use from Python.☆25Updated 3 years ago
- 法律・判例関係のデータセット☆33Updated last month
- A Japanese dependency parser based on BERT☆22Updated 2 years ago
- ☆10Updated 7 years ago
- Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)☆76Updated last year
- このライブラリは、ひらがな・カタカナ、半角・全角の相互変換や判別を始めとした機能を提供します。☆19Updated last month
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆17Updated 2 years ago
- ☆20Updated 3 weeks ago
- ☆96Updated last year
- Namelti : The automatic transcription generation library for person name in Katakana☆21Updated last year
- Japanese tokenizer for Transformers☆80Updated last year
- 最小のサーチエンジン/PageRank/tf-idf☆19Updated last year
- SQL linter tool for BigQuery GoogleSQL (formerly known as StandardSQL).☆17Updated 4 months ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆84Updated 4 years ago
- rust + lindera + webassembly + next.js + typescriptで形態素解析するサンプル☆41Updated 4 years ago
- 🦞 Rust library of natural language dictionaries using character-wise double-array tries.☆29Updated last month
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆47Updated 2 months ago
- japanese sentence segmentation library for python☆70Updated last year
- Kannon is a wrapper for the gokart library that allows gokart tasks to be easily executed in a distributed and parallel manner on multipl…☆25Updated last month
- Asynchronous Programming in Rust 日本語版☆14Updated 2 years ago
- Japanese Morphological Analyzer written in Rust☆96Updated last month
- [WIP] Twitter Client Library written in Rust☆49Updated 3 years ago