lifan-yuan / PLMCalibration
Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"
☆12Updated last year
Related projects: ⓘ
- Methods and evaluation for aligning language models temporally☆24Updated 6 months ago
- ☆23Updated last year
- ☆39Updated 9 months ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆38Updated last year
- ☆49Updated last year
- ☆42Updated 7 months ago
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆19Updated last year
- ☆36Updated 5 months ago
- ☆32Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 6 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- Constrained Decoding Project☆17Updated 10 months ago
- ☆44Updated 2 weeks ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated 2 weeks ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models"☆54Updated 8 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 9 months ago
- Active Example Selection for In-Context Learning (EMNLP'22)☆43Updated last month
- ☆70Updated 10 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆10Updated 7 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆19Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆61Updated last year
- ☆77Updated last year
- ☆18Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆26Updated last year
- ☆32Updated 5 months ago
- Restore safety in fine-tuned language models through task arithmetic☆25Updated 5 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- ☆23Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆26Updated 2 years ago