nlp-waseda/JMMLU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nlp-waseda/JMMLU)

nlp-waseda / JMMLU

日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark

☆40

Alternatives and similar repositories for JMMLU

Users that are interested in JMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llm-jp / llm-jp-eval
View on GitHub
☆165Jul 19, 2026Updated last week
ku-nlp / ja-vicuna-qa-benchmark
View on GitHub
☆33Jul 31, 2024Updated last year
llm-jp / llm-jp-sft
View on GitHub
☆62Jun 13, 2024Updated 2 years ago
pfnet-research / pfgen-bench
View on GitHub
Preferred Generation Benchmark
☆102Mar 6, 2026Updated 4 months ago
lighttransport / japanese-llama-experiment
View on GitHub
Japanese LLaMa experiment
☆54Dec 27, 2025Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yuzu-ai / japanese-llm-ranking
View on GitHub
☆50Apr 10, 2024Updated 2 years ago
swallow-llm / swallow-evaluation
View on GitHub
Swallowプロジェクト大規模言語モデル評価スクリプト
☆25Sep 17, 2025Updated 10 months ago
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
nlp-waseda / traveling-across-languages
View on GitHub
Official repo and evaluation implementation of KnowRecall and VisRecall
☆10May 22, 2025Updated last year
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
stardust-coder / awesome-latest-LLM
View on GitHub
最新LLMの一覧を作成します
☆23Jun 20, 2026Updated last month
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
kunishou / do-not-answer-ja
View on GitHub
☆24Dec 15, 2023Updated 2 years ago
nu-dialogue / jmultiwoz
View on GitHub
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024
☆25Mar 27, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sociocom / JMED-LLM
View on GitHub
JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models
☆59Sep 22, 2024Updated last year
ou-medinfo / medbertjp
View on GitHub
Trials of pre-trained BERT models for the medical domain in Japanese.
☆13Nov 21, 2020Updated 5 years ago
mizuumi / JDocQA
View on GitHub
☆44Apr 10, 2025Updated last year
nouu-me / document_vector_search_benchmark
View on GitHub
Benchmark for Japanese document embedding & vector search
☆29Mar 12, 2024Updated 2 years ago
stardust-coder / japanese-lm-med-harness
View on GitHub
☆11Oct 2, 2024Updated last year
shisa-ai / shaberi
View on GitHub
Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda
☆19Apr 29, 2026Updated 2 months ago
SAP / software-documentation-data-set-for-machine-translation
View on GitHub
A parallel evaluation data set of SAP software documentation with document structure annotation
☆15Jun 12, 2026Updated last month
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago
wandb / llm-leaderboard
View on GitHub
Project of llm evaluation to Japanese tasks
☆94Jul 15, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sbintuitions / JMTEB
View on GitHub
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
☆93Mar 16, 2026Updated 4 months ago
nlp-waseda / comet-atomic-ja
View on GitHub
COMET-ATOMIC ja
☆31Mar 8, 2024Updated 2 years ago
KanHatakeyama / JapaneseWarcParser
View on GitHub
☆16Mar 4, 2024Updated 2 years ago
ku-nlp / text-cleaning
View on GitHub
A powerful text cleaner for Japanese web texts
☆12Jan 20, 2024Updated 2 years ago
rioyokotalab / Megatron-Llama2
View on GitHub
2023 ABCI Llama-2 継続学習プロジェクト
☆14Jan 22, 2024Updated 2 years ago
inspection-ai / japanese-toxic-dataset
View on GitHub
☆22Jan 11, 2023Updated 3 years ago
nu-dialogue / real-persona-chat
View on GitHub
RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities
☆66Mar 13, 2024Updated 2 years ago
youichiro / transformer-copy
View on GitHub
日本語文法誤り訂正ツール
☆29Jun 22, 2022Updated 4 years ago
ku-nlp / VISA
View on GitHub
An ambiguous subtitles dataset for visual scene-aware machine translation
☆14Oct 17, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
llm-jp / llm-jp-tokenizer
View on GitHub
☆48Mar 30, 2026Updated 3 months ago
shisa-ai / shisa-v2
View on GitHub
Japanese / English Bilingual LLM
☆34Dec 23, 2025Updated 7 months ago
nii-nlp / med-eval
View on GitHub
Evaluation Pipeline for medical tasks.
☆12Apr 8, 2026Updated 3 months ago
kaiyuhwang / MLLM-Survey
View on GitHub
The paper list of multilingual pre-trained models (Continual Updated).
☆25Jun 18, 2024Updated 2 years ago
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago
jumon / himitsu
View on GitHub
An official implementation of the paper "Addressing Segmentation Ambiguity in Neural Linguistic Steganography"
☆14Nov 12, 2022Updated 3 years ago
project-miracl / nomiracl
View on GitHub
NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…
☆27Nov 29, 2024Updated last year