MIRAGE is a light benchmark to evaluate RAG performance.
☆37May 18, 2025Updated 11 months ago
Alternatives and similar repositories for MIRAGE
Users that are interested in MIRAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Feb 27, 2024Updated 2 years ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Dec 30, 2025Updated 4 months ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆213Apr 14, 2026Updated 3 weeks ago
- ☆19Mar 4, 2024Updated 2 years ago
- ☆39Aug 20, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 3 years ago
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆43Mar 31, 2025Updated last year
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆17Feb 13, 2026Updated 2 months ago
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- [ACL 2024] REANO: Optimising Retrieval-Augmented Reader Models through Knowledge Graph Generation☆12Sep 4, 2024Updated last year
- Project template for STAT-4830☆19Feb 16, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- ☆19Jun 3, 2024Updated last year
- It is about how to load and aggregate pretrained word embeddings in pytorch, e.g., ELMo\BERT\XLNET.☆12Mar 2, 2020Updated 6 years ago
- ☆10Nov 12, 2024Updated last year
- Code for Open-ended Knowledge Tracing☆24Oct 18, 2023Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- ☆28Feb 11, 2026Updated 2 months ago
- Deep Generative Models course, 2021☆22Dec 25, 2021Updated 4 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmarking library for RAG☆268Mar 11, 2026Updated last month
- Deep Generative Models course, 2025☆10Jun 5, 2025Updated 11 months ago
- Named-entity datasets and GloVe models for the Armenian language☆11Oct 23, 2018Updated 7 years ago
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 8 months ago
- My solutions of python shad course☆14Oct 20, 2024Updated last year
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 6 months ago
- Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE☆15Nov 25, 2022Updated 3 years ago
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆22Jan 13, 2025Updated last year
- bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.☆144Mar 17, 2026Updated last month
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)☆15Sep 6, 2022Updated 3 years ago
- ☆22Nov 24, 2022Updated 3 years ago
- Adding random noise to a text dataset, and controlling very accurately the quality of the result☆20Apr 13, 2026Updated 3 weeks ago