mila-iqia / Casande-RLLinks
Casande-RL
☆11Updated 2 years ago
Alternatives and similar repositories for Casande-RL
Users that are interested in Casande-RL are comparing it to the libraries listed below
Sorting:
- ☆33Updated 9 months ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆28Updated 2 years ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆75Updated this week
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21Updated last year
- [Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".☆30Updated 2 months ago
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆45Updated 5 months ago
- [NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models☆102Updated last year
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆38Updated last year
- Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)☆26Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆200Updated 11 months ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)☆12Updated last year
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Updated last year
- A Paper collection for LLM based Patient Simulators☆67Updated last month
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆40Updated last year
- ☆47Updated last year
- ☆52Updated 11 months ago
- Official repository of the MIRAGE benchmark☆177Updated last year
- MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024☆32Updated 3 months ago
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆141Updated last year
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆41Updated 5 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆77Updated last year
- ☆37Updated 10 months ago
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆10Updated 5 months ago
- ☆20Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 7 months ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆20Updated 9 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆45Updated 11 months ago
- ☆12Updated 8 months ago
- A Survey on Medical Report Generation: From Deep Neural Networks to Large Language Models☆29Updated last year