日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆38Oct 7, 2025Updated 6 months ago
Alternatives and similar repositories for JMMLU
Users that are interested in JMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆150Mar 30, 2026Updated 2 weeks ago
- ☆33Jul 31, 2024Updated last year
- ☆62Jun 13, 2024Updated last year
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 3 months ago
- ☆50Apr 10, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 最新LLMの一覧を作成します☆22Apr 9, 2026Updated last week
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 6 months ago
- ☆19May 23, 2024Updated last year
- ☆30Apr 10, 2025Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- ☆24Dec 15, 2023Updated 2 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆25Mar 27, 2024Updated 2 years ago
- Japanese translation of Open Source AI Definition☆26Nov 15, 2024Updated last year
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆56Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆44Apr 10, 2025Updated last year
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Nov 21, 2020Updated 5 years ago
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- Project of llm evaluation to Japanese tasks☆92Feb 4, 2026Updated 2 months ago
- ☆16Mar 4, 2024Updated 2 years ago
- COMET-ATOMIC ja☆31Mar 8, 2024Updated 2 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆89Mar 16, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Japanese / English Bilingual LLM☆28Dec 23, 2025Updated 3 months ago
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Mar 29, 2026Updated 2 weeks ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated last week
- ☆21Jan 11, 2023Updated 3 years ago
- ☆46Mar 30, 2026Updated 2 weeks ago
- n-wise coverage tool for combinatorial testing☆11Sep 7, 2019Updated 6 years ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆63Mar 13, 2024Updated 2 years ago
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- 日本語文法誤り訂正ツール☆29Jun 22, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆339Mar 31, 2025Updated last year
- ☆44Feb 2, 2024Updated 2 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆21Jul 10, 2023Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets☆12Mar 31, 2025Updated last year
- ☆15Nov 20, 2025Updated 4 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year