dmis-lab / OLAPH
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
☆40Updated 8 months ago
Alternatives and similar repositories for OLAPH:
Users that are interested in OLAPH are comparing it to the libraries listed below
- [NAACL '25] Rationale-Guided Retrieval Augmented Generation for Medical Question Answering☆23Updated 2 months ago
- ☆90Updated 3 months ago
- Self-verification for LLMs.☆64Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆48Updated 10 months ago
- ☆34Updated 4 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated 3 weeks ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆93Updated 4 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆94Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- Clinical NLP Shared Task @ NAACL'24☆32Updated 2 months ago
- Biomedical Question Answering Datasets.☆105Updated last week
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆104Updated 7 months ago
- Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…☆16Updated 10 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆79Updated last year
- [ISMB '24] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆62Updated last year
- ☆48Updated 2 months ago
- ☆43Updated 8 months ago
- [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records☆79Updated 5 months ago
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆50Updated last year
- ☆45Updated last month
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆54Updated 2 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆112Updated last week
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆29Updated 2 months ago
- ☆28Updated 6 months ago
- Retrieval-Augmented Generation battle!☆50Updated 4 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆120Updated 8 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 4 months ago
- NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆36Updated 4 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆57Updated 3 months ago