日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆40Oct 7, 2025Updated 8 months ago
Alternatives and similar repositories for JMMLU
Users that are interested in JMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆160Jun 3, 2026Updated last week
- ☆33Jul 31, 2024Updated last year
- ☆62Jun 13, 2024Updated 2 years ago
- Preferred Generation Benchmark☆96Mar 6, 2026Updated 3 months ago
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆50Apr 10, 2024Updated 2 years ago
- 最新LLMの一覧を作成します☆22Apr 27, 2026Updated last month
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 8 months ago
- ☆19Apr 21, 2026Updated last month
- ☆30Apr 10, 2025Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- ☆24Dec 15, 2023Updated 2 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆25Mar 27, 2024Updated 2 years ago
- Japanese translation of Open Source AI Definition☆26Nov 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Trials of pre-trained BERT models for the medical domain in Japanese.☆13Nov 21, 2020Updated 5 years ago
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆58Sep 22, 2024Updated last year
- ☆44Apr 10, 2025Updated last year
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- Project of llm evaluation to Japanese tasks☆94May 13, 2026Updated last month
- COMET-ATOMIC ja☆31Mar 8, 2024Updated 2 years ago
- ☆16Mar 4, 2024Updated 2 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆90Mar 16, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Japanese / English Bilingual LLM☆30Dec 23, 2025Updated 5 months ago
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Apr 21, 2026Updated last month
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- ☆22Jan 11, 2023Updated 3 years ago
- n-wise coverage tool for combinatorial testing☆11Sep 7, 2019Updated 6 years ago
- ☆47Mar 30, 2026Updated 2 months ago
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- 日本語文法誤り訂正ツール☆29Jun 22, 2022Updated 3 years ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆344Mar 31, 2025Updated last year
- ☆47Feb 2, 2024Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets☆13Mar 31, 2025Updated last year
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆21Jul 10, 2023Updated 2 years ago
- ☆15Nov 20, 2025Updated 6 months ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆27Nov 29, 2024Updated last year