d223302 / A-Closer-Look-To-LLM-Evaluation
Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"
☆16Updated 11 months ago
Related projects: ⓘ
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆43Updated 11 months ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆36Updated last year
- ☆34Updated last year
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆16Updated 2 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆45Updated 2 years ago
- ☆10Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆29Updated 11 months ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆52Updated 3 months ago
- ☆28Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆16Updated last month
- Interpretable unified language safety checking with large language models☆30Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆38Updated 2 months ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆63Updated last year
- ☆26Updated 8 months ago
- ☆24Updated 6 months ago
- TBC☆26Updated last year
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆39Updated last year
- ☆44Updated last month
- ☆95Updated 2 years ago
- The git repository of Modular Prompted Chatbot paper☆33Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆118Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 3 months ago
- ☆20Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆11Updated last year
- ☆21Updated last year
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Updated 2 years ago
- ☆24Updated 4 months ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆51Updated 3 years ago