po3rin / sudachi-elasticsearch-sampleLinks
Sample of Elasticsearch Docker using Sudachi analizer
☆10Updated 2 years ago
Alternatives and similar repositories for sudachi-elasticsearch-sample
Users that are interested in sudachi-elasticsearch-sample are comparing it to the libraries listed below
Sorting:
- ☆51Updated 2 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆249Updated 3 weeks ago
- 『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ☆22Updated 2 years ago
- ☆27Updated last year
- Japanese Morphological Analyzer written in Rust☆107Updated last week
- 原論文から解き明かす生成AI(技術評論社)のサポートページです☆84Updated 3 weeks ago
- 法律・判例関係のデータセット☆45Updated last year
- optpy is a transpiler to generate a Rust file from a Python file☆25Updated 3 years ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Updated 3 years ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆12Updated 11 months ago
- Japanese synonym library☆55Updated 3 years ago
- ☆23Updated 11 months ago
- Finding all pairs of similar documents time- and memory-efficiently☆62Updated 9 months ago
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Updated last year
- rust + lindera + webassembly + next.js + typescriptで形態素解析するサンプル☆41Updated 5 years ago
- Making Law Easy☆64Updated 3 months ago
- ☆16Updated last year
- ☆44Updated 4 months ago
- ☆11Updated last year
- 最小のサーチエンジン/PageRank/tf-idf☆19Updated 2 years ago
- ☆88Updated 2 years ago
- Arguments parser with class for Python, inspired by StructOpt☆62Updated 2 years ago
- Testing tool to verify the search qualities of the Elasticsearch indices☆29Updated 3 years ago
- デジタル化資料OCRテキスト化事業において作成されたOCR学習用データセット☆74Updated last year
- ☆25Updated 3 years ago
- 自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器☆140Updated 10 months ago
- DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.☆46Updated 2 years ago
- Recording Composition Tool Hisui☆22Updated last week
- DuckDB-Wasm (FTS 拡張) + Lindera-Wasm☆36Updated 3 months ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆81Updated last week