The Japanese analysis plugin for elasticsearch
☆220Jun 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for elasticsearch-sudachi
Users that are interested in elasticsearch-sudachi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A lexicon for Sudachi☆299Apr 30, 2026Updated last month
- A Japanese Tokenizer for Business☆977May 26, 2026Updated 3 weeks ago
- Japanese synonym library☆11Apr 18, 2022Updated 4 years ago
- Japanese synonym library☆55Feb 7, 2022Updated 4 years ago
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Funer is Rule based Named Entity Recognition tool.☆22Apr 21, 2022Updated 4 years ago
- Sudachi in Rust 🦀 and new generation of SudachiPy☆448Jun 2, 2026Updated 2 weeks ago
- Python version of Sudachi, a Japanese tokenizer.☆439Oct 7, 2022Updated 3 years ago
- Elasticsearch's Analyzer for Kuromoji with Neologd☆115Nov 22, 2023Updated 2 years ago
- 「仕事ではじめる検索システム」という本があったなら,という想像の産物です -> 「検索システム ― 実務者のための開発改善ガイドブック」になりました☆144May 16, 2022Updated 4 years ago
- ☆99Jul 23, 2023Updated 2 years ago
- ☆11Jun 17, 2024Updated 2 years ago
- kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.☆11Apr 26, 2025Updated last year
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)☆199Mar 26, 2024Updated 2 years ago
- Japanese tokenizer for Transformers☆79Dec 15, 2023Updated 2 years ago
- A Japanese NLP Library using spaCy as framework based on Universal Dependencies☆855Mar 30, 2024Updated 2 years ago
- ISUNARABE の練習 VM 用イメージをビルドするためのパイプライン☆11Mar 20, 2024Updated 2 years ago
- Code for COLING 2020 Paper☆13Feb 3, 2026Updated 4 months ago
- A multilingual morphological analysis library.☆633Updated this week
- 日本十進分類法のIME辞書☆11Dec 8, 2022Updated 3 years ago
- Neologism dictionary based on the language resources on the Web for mecab-ipadic☆2,785Dec 27, 2023Updated 2 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- ☆74Aug 3, 2025Updated 10 months ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆90Mar 16, 2026Updated 3 months ago
- japanese sentence segmentation library for python☆74Apr 3, 2023Updated 3 years ago
- 🦞 Rust library of natural language dictionaries using character-wise double-array tries.☆38Jan 13, 2025Updated last year
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.☆48Jun 7, 2026Updated last week
- The full-text search system for Aozora Bunko by Groonga. 青空文庫全文検索ライブラリ兼Webアプリ。☆22Jun 9, 2026Updated last week
- Japanese word embedding with Sudachi and NWJC 🌿☆175Mar 1, 2024Updated 2 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆289Feb 7, 2026Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Kawaii Chatbot Plugin for BotBone☆16Dec 8, 2022Updated 3 years ago
- 『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ☆23Aug 3, 2023Updated 2 years ago
- ☆10Jan 12, 2018Updated 8 years ago
- Namelti : The automatic transcription generation library for person name in Katakana☆22Jul 10, 2023Updated 2 years ago
- Japanese text normalizer for mecab-neologd☆289May 6, 2026Updated last month
- textlint rule plugin to check duplicated conjunctive particle `ga` in a sentence.☆11Nov 26, 2023Updated 2 years ago
- Japanese Morphological Analysis written in Rust☆84Dec 30, 2021Updated 4 years ago