dmis-lab / OLAPH
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
☆39Updated 7 months ago
Alternatives and similar repositories for OLAPH:
Users that are interested in OLAPH are comparing it to the libraries listed below
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆90Updated 3 months ago
- ☆89Updated 2 months ago
- Self-verification for LLMs.☆64Updated last year
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆48Updated 10 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆94Updated last year
- Clinical NLP Shared Task @ NAACL'24☆31Updated last month
- ☆34Updated 3 months ago
- ☆48Updated last month
- Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…☆16Updated 9 months ago
- [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records☆78Updated 4 months ago
- PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems…☆65Updated last year
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆49Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆104Updated 6 months ago
- NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆32Updated 3 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆120Updated 8 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆51Updated 3 weeks ago
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆105Updated 7 months ago
- ☆49Updated last year
- Biomedical Question Answering Datasets.☆101Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆118Updated 7 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref …☆47Updated last month
- [NAACL '25] Rationale-Guided Retrieval Augmented Generation for Medical Question Answering☆18Updated last month
- Official repository of the MIRAGE benchmark☆132Updated 5 months ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆38Updated 9 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 2 months ago
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆63Updated 2 months ago
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆20Updated 7 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ☆50Updated last year
- Medical Hallucination in Foundation Models and Their Impact on Healthcare (2025)☆48Updated last month