lifan-yuan / PLMCalibrationLinks
Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"
☆11Updated 2 years ago
Alternatives and similar repositories for PLMCalibration
Users that are interested in PLMCalibration are comparing it to the libraries listed below
Sorting:
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆62Updated last year
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Updated last year
- ☆177Updated last year
- ☆88Updated 3 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆118Updated last year
- ☆44Updated last year
- ☆79Updated 2 years ago
- ☆75Updated 2 years ago
- Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Const…☆65Updated last year
- Analyzing LLM Alignment via Token distribution shift☆17Updated last year
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆21Updated 2 years ago
- Monitoring the health of ARR☆27Updated 2 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆65Updated 2 years ago
- ☆64Updated 3 years ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Updated 2 years ago
- ☆55Updated last year
- Constrained Decoding Project☆20Updated 2 years ago
- ☆29Updated last year
- ☆25Updated 6 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆70Updated 3 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Updated last year
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆66Updated 2 years ago
- ☆32Updated 3 years ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆16Updated 2 years ago
- ☆57Updated 7 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Updated 2 years ago
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 3 years ago
- ☆28Updated last year