AUGMXNT / shisaView external linksLinks
☆42Mar 30, 2024Updated last year
Alternatives and similar repositories for shisa
Users that are interested in shisa are comparing it to the libraries listed below
Sorting:
- ☆24Dec 15, 2023Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 7 months ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆25Jan 20, 2025Updated last year
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆124Nov 13, 2025Updated 3 months ago
- Japanese instruction data (日本語指示データ)☆24Jul 13, 2023Updated 2 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆25Mar 27, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆154Sep 13, 2024Updated last year
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Nov 21, 2020Updated 5 years ago
- Preferred Generation Benchmark☆91Oct 28, 2025Updated 3 months ago
- ☆16Mar 4, 2024Updated last year
- ☆33Jul 31, 2024Updated last year
- ☆17Sep 29, 2024Updated last year
- ☆19Dec 6, 2024Updated last year
- ☆16Apr 11, 2024Updated last year
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 4 months ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆42Sep 9, 2025Updated 5 months ago
- ☆16Nov 19, 2023Updated 2 years ago
- ☆41Apr 10, 2025Updated 10 months ago
- Swallowプロジェクト 事後学習済み大規模言語モデル 評価フレームワーク☆24Oct 20, 2025Updated 3 months ago
- 法律・判例関係のデータセット☆49Jan 8, 2025Updated last year
- https://vaccines-kyoto-city.jp/#faq をHTML化したものです☆17May 31, 2021Updated 4 years ago
- ☆17Sep 23, 2021Updated 4 years ago
- ☆21Jan 11, 2023Updated 3 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆333Mar 31, 2025Updated 10 months ago
- ☆50Apr 10, 2024Updated last year
- ⚡Japanese sentence splitting(日本語文境界判定器), 40–250× faster via a Rust-accelerated Python library with near-perfect API compatibility with …☆63Oct 14, 2025Updated 3 months ago
- ☆26Jul 13, 2023Updated 2 years ago
- ☆43Feb 2, 2024Updated 2 years ago
- 敬語変換タスクにおける評価用データセット☆21Nov 24, 2022Updated 3 years ago
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆23Sep 17, 2025Updated 4 months ago
- Project of llm evaluation to Japanese tasks☆91Feb 4, 2026Updated last week
- A cute little python module for calculating different ranking metrics. Based entirely on the gist from @bwhite: https://gist.github.com/b…☆21Apr 12, 2023Updated 2 years ago
- Japanese LLaMa experiment☆54Dec 27, 2025Updated last month
- 🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.☆21Jun 1, 2025Updated 8 months ago
- Japanese translation of Open Source AI Definition☆26Nov 15, 2024Updated last year
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated last year
- DSPy-powered email optimization for startup founders: drop in your 3 best emails, get optimized outreach for new leads☆39Sep 14, 2025Updated 4 months ago