dqxiu / KAssess
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for KAssess
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆48Updated 4 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆25Updated 4 months ago
- ☆40Updated 11 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆11Updated last year
- Methods and evaluation for aligning language models temporally☆24Updated 8 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆30Updated 3 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆17Updated last month
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆62Updated last year
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆13Updated 3 months ago
- ☆24Updated last year
- ☆66Updated 6 months ago
- ☆16Updated 8 months ago
- ☆25Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Updated 10 months ago
- Towards Systematic Measurement for Long Text Quality☆29Updated 2 months ago
- ☆12Updated last year
- ☆33Updated 2 years ago
- ☆14Updated 2 years ago
- ☆39Updated 7 months ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆27Updated last year
- Personality Alignment of Language Models☆18Updated 2 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 8 months ago
- GPT as Human☆18Updated 10 months ago
- ☆17Updated 2 years ago