☆73Jan 28, 2026Updated last month
Alternatives and similar repositories for aci-bench
Users that are interested in aci-bench are comparing it to the libraries listed below
Sorting:
- A new collection of 1.7k doctor-patient conversations and corresponding clinical notes/summaries.☆115May 31, 2025Updated 9 months ago
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆42Sep 27, 2023Updated 2 years ago
- Repository for the paper 'Enhancing Clinical Decision Support with Physiological Waveforms — A Multimodal Benchmark in Emergency Care'.☆22Apr 30, 2025Updated 10 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21May 18, 2024Updated last year
- A central repository for curating and managing diverse datasets used in healthcare applications.☆11Jun 8, 2024Updated last year
- Dataset of 57 mock medical primary care consultations: audio, consultation notes, human utterance-level transcripts.☆71Nov 16, 2022Updated 3 years ago
- [NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage☆16Sep 2, 2025Updated 6 months ago
- Dataset and Evaluation Code for the K-QA Benchmark.☆18May 26, 2024Updated last year
- ☆67Jul 19, 2022Updated 3 years ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆98May 17, 2025Updated 9 months ago
- ☆18Oct 13, 2022Updated 3 years ago
- Source codes of the paper "Hierarchical Pretraining on Multimodal Electronic Health Records".☆20Apr 10, 2024Updated last year
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆26Aug 21, 2024Updated last year
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Jun 18, 2025Updated 8 months ago
- CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering☆23Feb 26, 2021Updated 5 years ago
- ☆25Jan 15, 2024Updated 2 years ago
- ☆14Aug 29, 2025Updated 6 months ago
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆36Sep 18, 2025Updated 5 months ago
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆80Apr 12, 2024Updated last year
- ☆27Dec 12, 2024Updated last year
- ☆30Oct 13, 2023Updated 2 years ago
- Expert-Curated Oncology Reports to Advance Language Model Inference☆33Apr 17, 2024Updated last year
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 8 months ago
- The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"☆44Oct 10, 2025Updated 4 months ago
- KAIST AI605 Deep Learning for NLP☆31Jun 6, 2022Updated 3 years ago
- This project implements a RAG (Retrieval-Augmented Generation) system using an open-source stack. It utilizes BioMistral 7B as the main m…☆37Feb 22, 2024Updated 2 years ago
- [NeurIPS 2024 D&B] Official code for "EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries"☆41Jan 11, 2025Updated last year
- Some resources (books, paper, video and online courses) about ML,DL,DM☆12Mar 14, 2021Updated 4 years ago
- ☆31Nov 24, 2022Updated 3 years ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 4 months ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Jun 23, 2024Updated last year
- code for modular summarization work published in ACL2021 by Krishna et al☆30Nov 4, 2021Updated 4 years ago
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆41Sep 19, 2024Updated last year
- Clinical NLP Shared Task @ NAACL'24☆42Aug 20, 2025Updated 6 months ago
- Deep Learning for EHR papers☆34Aug 25, 2019Updated 6 years ago
- ☆11May 18, 2022Updated 3 years ago
- SMART on FHIR vue implementation of the clear blue button health record reference☆10Jan 14, 2021Updated 5 years ago
- Detect-Then-Explain Framework for Text-to-SQL task☆10Dec 6, 2023Updated 2 years ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago