The Japanese analysis plugin for elasticsearch
☆219Feb 27, 2026Updated last week
Alternatives and similar repositories for elasticsearch-sudachi
Users that are interested in elasticsearch-sudachi are comparing it to the libraries listed below
Sorting:
- A Japanese Tokenizer for Business☆942Jun 17, 2025Updated 8 months ago
- A lexicon for Sudachi☆280Jan 20, 2026Updated last month
- Japanese synonym library☆11Apr 18, 2022Updated 3 years ago
- Japanese synonym library☆55Feb 7, 2022Updated 4 years ago
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- ☆100Jul 23, 2023Updated 2 years ago
- Sudachi in Rust 🦀 and new generation of SudachiPy☆428Updated this week
- Elasticsearch's Analyzer for Kuromoji with Neologd☆114Nov 22, 2023Updated 2 years ago
- 高速な書影撮影システム「オープンブックカメラ」☆23Apr 29, 2023Updated 2 years ago
- Funer is Rule based Named Entity Recognition tool.☆22Apr 21, 2022Updated 3 years ago
- Code for COLING 2020 Paper☆13Feb 3, 2026Updated last month
- Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)☆199Mar 26, 2024Updated last year
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago
- Python version of Sudachi, a Japanese tokenizer.☆428Oct 7, 2022Updated 3 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 8 years ago
- Japanese tokenizer for Transformers☆79Dec 15, 2023Updated 2 years ago
- 「仕事ではじめる検索システム」という本があったなら,という想像の産物です -> 「検索システム ― 実務者のための開発改善ガイドブック」になりました☆142May 16, 2022Updated 3 years ago
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- A Japanese NLP Library using spaCy as framework based on Universal Dependencies☆834Mar 30, 2024Updated last year
- 専門用語抽出アルゴリズムの実装の練習☆18Sep 26, 2018Updated 7 years ago
- The full-text search system for Aozora Bunko by Groonga. 青空文庫全文検索ライブラリ兼Webアプリ。☆21Mar 8, 2023Updated 3 years ago
- Toy File System, currently using FUSE.☆10Jun 1, 2019Updated 6 years ago
- Sample of Elasticsearch Docker using Sudachi analizer☆10Sep 18, 2023Updated 2 years ago
- Tokyo Metropolitan University Paraphrase Corpus (TMUP)☆11Jun 12, 2017Updated 8 years ago
- A multilingual morphological analysis library.☆606Feb 27, 2026Updated last week
- ☆11Jun 17, 2024Updated last year
- kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.☆11Apr 26, 2025Updated 10 months ago
- ☆10Jan 12, 2018Updated 8 years ago
- 『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ☆22Aug 3, 2023Updated 2 years ago
- Neologism dictionary based on the language resources on the Web for mecab-ipadic☆2,788Dec 27, 2023Updated 2 years ago
- Namelti : The automatic transcription generation library for person name in Katakana☆21Jul 10, 2023Updated 2 years ago
- japanese sentence segmentation library for python☆73Apr 3, 2023Updated 2 years ago
- ☆73Aug 3, 2025Updated 7 months ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆87Updated this week
- 🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.☆261Mar 1, 2026Updated last week
- notmecab-rs is a very basic mecab clone, designed only to do parsing, not training.☆18Jul 25, 2020Updated 5 years ago
- ☆13Apr 23, 2017Updated 8 years ago
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆45Feb 25, 2024Updated 2 years ago
- redpen custom validator☆14Sep 19, 2016Updated 9 years ago