project-miracl / nomiraclView external linksLinks
NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 languages.
☆26Nov 29, 2024Updated last year
Alternatives and similar repositories for nomiracl
Users that are interested in nomiracl are comparing it to the libraries listed below
Sorting:
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆18Jan 13, 2025Updated last year
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Jan 6, 2026Updated last month
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 4 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year
- ☆20Mar 22, 2024Updated last year
- ☆19May 23, 2024Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆23Sep 17, 2025Updated 4 months ago
- ☆30Jun 3, 2024Updated last year
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated last year
- ☆28Oct 31, 2023Updated 2 years ago
- COMET-ATOMIC ja☆31Mar 8, 2024Updated last year
- A lightweight framework for evaluating visual-language models.☆41Jan 16, 2026Updated 3 weeks ago
- Pequenos projetos e testes simples em linguagem Python.☆11Jan 28, 2018Updated 8 years ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆42Sep 9, 2025Updated 5 months ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- web programming course (COMPSCI 326, UMass Amherst)☆14Sep 13, 2022Updated 3 years ago
- Stochastic Kronecker Generation in Python, Used in RPI TRUST☆10Dec 13, 2017Updated 8 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated 2 weeks ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 3 years ago
- ☆13May 11, 2021Updated 4 years ago
- Evaluation Pipeline for medical tasks.☆12Updated this week
- ICHEC Quantum natural language processing (QNLP) toolkit☆41Oct 1, 2020Updated 5 years ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆11Aug 1, 2023Updated 2 years ago
- Ultimate playbook for unmoderated UX testing☆13Jan 27, 2025Updated last year
- Background materials for the article "Productivity Assessment of Neural Code Completion"☆13Jul 11, 2023Updated 2 years ago
- Deep Boltzmann Machines in R^N dimensions☆11May 14, 2014Updated 11 years ago
- ☆10Jun 16, 2021Updated 4 years ago
- ☆12Mar 1, 2025Updated 11 months ago
- code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"☆11Nov 6, 2020Updated 5 years ago
- ディープラーニングによる自然言語処理(共立出版)のサ ポートページです☆10May 7, 2023Updated 2 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- Regex base tail written in Rust☆10Mar 20, 2023Updated 2 years ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated 2 weeks ago
- ☆12Feb 27, 2022Updated 3 years ago
- A shareable Renovate config for Cybozu☆11Updated this week
- Amazon S3 CLI Tool by using promptui☆12Jun 18, 2025Updated 7 months ago
- ☆10Sep 13, 2022Updated 3 years ago