RulinShao / RAG-evaluation-harnessesView external linksLinks
An evaluation suite for Retrieval-Augmented Generation (RAG).
☆23Apr 26, 2025Updated 9 months ago
Alternatives and similar repositories for RAG-evaluation-harnesses
Users that are interested in RAG-evaluation-harnesses are comparing it to the libraries listed below
Sorting:
- ☆19Sep 16, 2025Updated 4 months ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- [LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges☆14May 30, 2024Updated last year
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 9 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 6 months ago
- Levin tree search guided by both a policy and a heuristic function☆19Jul 13, 2023Updated 2 years ago
- ☆14Apr 21, 2023Updated 2 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 3 years ago
- Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)☆15Apr 18, 2024Updated last year
- ☆21Aug 19, 2024Updated last year
- ☆18May 5, 2021Updated 4 years ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 5 months ago
- ☆27Jul 11, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- ☆26May 29, 2022Updated 3 years ago
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Mar 26, 2024Updated last year
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆24Nov 18, 2023Updated 2 years ago
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆36Dec 22, 2024Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated 11 months ago
- MICCAI 2024 code for the paper: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing. EchoNet-Synthetic i…☆36Jun 16, 2025Updated 7 months ago
- The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION☆42Feb 4, 2025Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- ☆17Oct 30, 2025Updated 3 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- Code for our CIKM'21 paper "Complex Temporal Qestion Answering on Knowledge Graphs"☆31Jan 13, 2024Updated 2 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated last year
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆31May 29, 2024Updated last year
- SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various…☆83May 21, 2025Updated 8 months ago
- ☆31Mar 24, 2023Updated 2 years ago
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- Concurrency library☆16Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year