dmis-lab / OLAPH
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
☆38Updated 5 months ago
Alternatives and similar repositories for OLAPH:
Users that are interested in OLAPH are comparing it to the libraries listed below
- ☆85Updated last week
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆46Updated 8 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆91Updated last year
- Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…☆14Updated 7 months ago
- Clinical NLP Shared Task @ NAACL'24☆28Updated last month
- ☆29Updated last month
- Self-verification for LLMs.☆63Updated last year
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆78Updated last month
- Biomedical Question Answering Datasets.☆93Updated last year
- ISMB'24 "Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models"☆48Updated 10 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆43Updated last month
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆159Updated 10 months ago
- ☆27Updated 3 months ago
- PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems…☆61Updated last year
- ☆25Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆74Updated 10 months ago
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆21Updated 2 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆43Updated last week
- [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records☆75Updated 2 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆98Updated 4 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆118Updated 6 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆28Updated 4 months ago
- ☆44Updated 4 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆53Updated last month
- Clinical text summarization by adapting large language models☆133Updated 6 months ago
- Retrieval-Augmented Generation battle!☆49Updated 2 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆77Updated 4 months ago