日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆39Oct 7, 2025Updated 6 months ago
Alternatives and similar repositories for JMMLU
Users that are interested in JMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆152Apr 28, 2026Updated last week
- ☆33Jul 31, 2024Updated last year
- ☆62Jun 13, 2024Updated last year
- Preferred Generation Benchmark☆94Mar 6, 2026Updated 2 months ago
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆50Apr 10, 2024Updated 2 years ago
- 最新LLMの一覧を作成します☆22Apr 27, 2026Updated last week
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 7 months ago
- ☆19Apr 21, 2026Updated 2 weeks ago
- Official repo and evaluation implementation of KnowRecall and VisRecall☆10May 22, 2025Updated 11 months ago
- ☆30Apr 10, 2025Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- ☆24Dec 15, 2023Updated 2 years ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆25Mar 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Japanese translation of Open Source AI Definition☆26Nov 15, 2024Updated last year
- JMED-LLM: Japanese Medical Evaluation Dataset for Large Language Models☆57Sep 22, 2024Updated last year
- ☆44Apr 10, 2025Updated last year
- Trials of pre-trained BERT models for the medical domain in Japanese.☆13Nov 21, 2020Updated 5 years ago
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jul 30, 2025Updated 9 months ago
- Project of llm evaluation to Japanese tasks☆93Apr 28, 2026Updated last week
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Mar 4, 2024Updated 2 years ago
- COMET-ATOMIC ja☆31Mar 8, 2024Updated 2 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆89Mar 16, 2026Updated last month
- Japanese / English Bilingual LLM☆29Dec 23, 2025Updated 4 months ago
- Japanese semantic test suite (FraCaS counterpart and extensions)☆13Apr 21, 2026Updated 2 weeks ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 3 weeks ago
- ☆21Jan 11, 2023Updated 3 years ago
- ☆47Mar 30, 2026Updated last month
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆64Mar 13, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- 日本語文法誤り訂正ツール☆29Jun 22, 2022Updated 3 years ago
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆339Mar 31, 2025Updated last year
- ☆45Feb 2, 2024Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets☆13Mar 31, 2025Updated last year