nii-nlp / med-evalLinks
Evaluation Pipeline for medical tasks.
☆12Updated last year
Alternatives and similar repositories for med-eval
Users that are interested in med-eval are comparing it to the libraries listed below
Sorting:
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Updated 2 years ago
- Methods and evaluation for aligning language models temporally☆30Updated last year
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆18Updated 2 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 3 years ago
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆15Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24Updated 3 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Updated 2 years ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Updated 3 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Updated 3 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated 2 years ago
- Word acquisition in neural language models (TACL 2022).☆20Updated 10 months ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated 2 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Updated 2 years ago
- ☆41Updated 2 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Updated 2 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Updated 2 years ago
- ☆22Updated last year
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆20Updated 2 years ago
- ☆35Updated 4 years ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Updated 2 years ago
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆41Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Updated last year
- ☆88Updated 2 years ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Updated 2 years ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Updated 2 years ago