swallow-llm/swallow-evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/swallow-llm/swallow-evaluation)

swallow-llm / swallow-evaluation

Swallowプロジェクト大規模言語モデル評価スクリプト

☆24

Alternatives and similar repositories for swallow-evaluation

Users that are interested in swallow-evaluation are comparing it to the libraries listed below

Sorting:

swallow-llm / swallow-evaluation-instruct
View on GitHub
Swallowプロジェクト事後学習済み大規模言語モデル評価フレームワーク
☆26Oct 20, 2025Updated 4 months ago
nlp-waseda / JMMLU
View on GitHub
日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆38Oct 7, 2025Updated 4 months ago
KanHatakeyama / JapaneseWarcParser
View on GitHub
☆16Mar 4, 2024Updated 2 years ago
llm-jp / llm-jp-sft
View on GitHub
☆62Jun 13, 2024Updated last year
pfnet-research / pfgen-bench
View on GitHub
Preferred Generation Benchmark
☆92Oct 28, 2025Updated 4 months ago
llm-jp / llm-jp-tokenizer
View on GitHub
☆46Sep 6, 2025Updated 5 months ago
Aratako / Japanese-RP-Bench
View on GitHub
☆18Sep 29, 2024Updated last year
wandb / llm-leaderboard
View on GitHub
Project of llm evaluation to Japanese tasks
☆92Feb 4, 2026Updated last month
matsuolab / ucllm_nedo_prod
View on GitHub
☆57Jun 17, 2024Updated last year
llm-jp / llm-jp-eval
View on GitHub
☆149Updated this week
llm-jp / llm-jp-eval-mm
View on GitHub
A lightweight framework for evaluating visual-language models.
☆41Jan 16, 2026Updated last month
sociocom / JMED-LLM
View on GitHub
JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models
☆56Sep 22, 2024Updated last year
Ino-Ichan / GIT-LLM
View on GitHub
☆22Sep 18, 2023Updated 2 years ago
pfnet-research / plamo-examples
View on GitHub
☆25May 29, 2025Updated 9 months ago
python-nlp-book / python-nlp-book
View on GitHub
ディープラーニングによる自然言語処理（共立出版）のサポートページです
☆10May 7, 2023Updated 2 years ago
Language-Media-Lab / commonsense-moral-ja
View on GitHub
☆15Nov 20, 2025Updated 3 months ago
okoge-kaz / llm-recipes
View on GitHub
Ongoing Research Project for continaual pre-training LLM(dense mode)
☆44Mar 3, 2025Updated last year
azooKey / AJIMEE-Bench
View on GitHub
AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)
☆18Jan 13, 2025Updated last year
nu-dialogue / jmultiwoz
View on GitHub
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024
☆25Mar 27, 2024Updated last year
hppRC / simple-simcse-ja
View on GitHub
Exploring Japanese SimCSE
☆69Oct 31, 2023Updated 2 years ago
kunishou / do-not-answer-ja
View on GitHub
☆24Dec 15, 2023Updated 2 years ago
speed1313 / jax-llm
View on GitHub
JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…
☆13Aug 5, 2024Updated last year
SatoruMuro / SAM2GUIfor3Drecon
View on GitHub
SegRef3D: AI-Powered Segmentation and Interactive Refinement for Labor-Saving 3D Reconstruction
☆16Feb 9, 2026Updated 3 weeks ago
rioyokotalab / Megatron-Llama2
View on GitHub
2023 ABCI Llama-2 継続学習プロジェクト
☆14Jan 22, 2024Updated 2 years ago
offtoung / ez-chat-llm
View on GitHub
Webブラウザから手軽にローカルLLMとおしゃべりできるソフトウェアです。
☆27Dec 28, 2023Updated 2 years ago
lighttransport / japanese-llama-experiment
View on GitHub
Japanese LLaMa experiment
☆54Dec 27, 2025Updated 2 months ago
shisa-ai / shaberi
View on GitHub
Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda
☆18Jan 6, 2026Updated last month
mariomeissner / lightning-hydra-transformers
View on GitHub
My take on how you should organize your transformer experiments.
☆13Apr 13, 2022Updated 3 years ago
lighttransport / jagger-python
View on GitHub
Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)
☆12Dec 16, 2025Updated 2 months ago
hppRC / bert-classification-tutorial-2024
View on GitHub
【2024年版】BERTによるテキスト分類
☆30Jul 8, 2024Updated last year
para-lost / RVP
View on GitHub
Recursive Visual Programming (ECCV 2024)
☆18Nov 20, 2024Updated last year
hitachi-nlp / FLD-corpus
View on GitHub
☆19Dec 6, 2024Updated last year
nu-dialogue / real-persona-chat
View on GitHub
RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities
☆63Mar 13, 2024Updated last year
Solafune-Inc / solafune-tools
View on GitHub
Open tools for Solafune developers and Solafune hackers where can share developed tools in geospatial data.
☆41Feb 23, 2026Updated last week
tosiyuki / LLaVA-JP
View on GitHub
LLaVA-JP is a Japanese VLM trained by LLaVA method
☆64Jul 3, 2024Updated last year
okoge-kaz / moe-recipes
View on GitHub
Ongoing research training Mixture of Expert models.
☆21Sep 16, 2024Updated last year
Aratako / Task-Vector-Merge-Optimzier
View on GitHub
☆17Apr 11, 2024Updated last year
mkobayashime / fest2019-web
View on GitHub
Website for 73rd Nada School Festival: 第73回灘校文化祭
☆13Apr 20, 2023Updated 2 years ago
nobu-g / JGLUE-evaluation-scripts
View on GitHub
Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark
☆18Updated this week