yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆24Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-temporal-alignment
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 8 months ago
- ☆24Updated last year
- ☆40Updated 11 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆47Updated 4 months ago
- ☆38Updated 7 months ago
- ☆59Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆59Updated 8 months ago
- ☆33Updated 2 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆25Updated 4 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆30Updated 3 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- ☆46Updated last month
- ☆25Updated last year
- ☆80Updated 2 years ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆65Updated 2 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated 2 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆50Updated 7 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆18Updated last year
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- ☆24Updated 6 months ago
- GPT as Human☆18Updated 10 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆61Updated last year
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆38Updated 2 years ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆27Updated last year