d223302 / A-Closer-Look-To-LLM-EvaluationLinks
Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"
☆18Updated last year
Alternatives and similar repositories for A-Closer-Look-To-LLM-Evaluation
Users that are interested in A-Closer-Look-To-LLM-Evaluation are comparing it to the libraries listed below
Sorting:
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆26Updated last year
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆16Updated 3 years ago
- ☆21Updated last year
- ☆33Updated 2 years ago
- Interpretable unified language safety checking with large language models☆31Updated 2 years ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Updated 2 years ago
- ☆22Updated 3 years ago
- ☆33Updated 3 months ago
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆92Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆25Updated last month
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆102Updated 2 years ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- ☆99Updated 3 years ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆30Updated 7 months ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆153Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆69Updated 2 years ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆54Updated last year
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Updated 2 years ago
- ☆34Updated 2 years ago
- ☆9Updated 3 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated 4 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 4 months ago
- ASSIST: Towards Label Noise-Robust Dialogue State Tracking☆10Updated 3 years ago
- ☆17Updated 4 months ago
- ☆12Updated 3 years ago
- ☆40Updated 2 years ago