d223302 / A-Closer-Look-To-LLM-EvaluationLinks

Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"

☆18

Alternatives and similar repositories for A-Closer-Look-To-LLM-Evaluation

Users that are interested in A-Closer-Look-To-LLM-Evaluation are comparing it to the libraries listed below

Sorting:

littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆26Updated last year
Yushi-Hu / IC-DST
Code base of In-Context Learning for Dialogue State tracking
☆45Updated last year
xieyxclack / factual_coco
The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.
☆16Updated 3 years ago
Adaxry / Post-Instruction
☆21Updated last year
tencent-ailab / Lodoss
☆33Updated 2 years ago
luohongyin / UniLC
Interpretable unified language safety checking with large language models
☆31Updated 2 years ago
BinWang28 / EvalRank-Embedding-Evaluation
ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities
☆35Updated 2 years ago
shh1574 / multi-modal-dialogue-dataset
☆22Updated 3 years ago
Silin159 / PeaCoK
☆33Updated 3 months ago
disi-unibo-nlp / nlg-metricverse
[COLING22] An End-to-End Library for Evaluating Natural Language Generation
☆92Updated last year
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
ZurichNLP / multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆25Updated last month
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
qqaatw / pytorch-realm-orqa
PyTorch reimplementation of REALM and ORQA
☆22Updated 3 years ago
NAR-tutorial / acl2022
☆99Updated 3 years ago
hongshi97 / CAD
Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"
☆30Updated 7 months ago
Shark-NLP / CoNT
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆153Updated 2 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
Alsace08 / SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
☆54Updated last year
shizhediao / T-DNA
Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…
☆19Updated 2 years ago
violet-zct / fairseq-detect-hallucination
Detect hallucinated tokens for conditional sequence generation.
☆64Updated 3 years ago
FUZHIYI / TACO
Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"
☆33Updated 2 years ago
guanzhchen / PETuning
☆34Updated 2 years ago
yq-wen / overlapping-datasets
☆9Updated 3 years ago
BinWang28 / FacEval
EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization
☆13Updated 4 months ago
xu1998hz / InstructScore_SEScore3
First explanation metric (diagnostic report) for text generation evaluation
☆62Updated 4 months ago
smartyfh / DST-ASSIST
ASSIST: Towards Label Noise-Robust Dialogue State Tracking
☆10Updated 3 years ago
yrf1 / LLM-MassiveMulticultureNormsKnowledge-NCLB
☆17Updated 4 months ago
cooelf / Paper_Writing_Tips
☆12Updated 3 years ago
FreddeFrallan / Non-Residual-Prompting
☆40Updated 2 years ago