d223302 / A-Closer-Look-To-LLM-Evaluation
Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"
☆18Updated last year
Alternatives and similar repositories for A-Closer-Look-To-LLM-Evaluation:
Users that are interested in A-Closer-Look-To-LLM-Evaluation are comparing it to the libraries listed below
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- Interpretable unified language safety checking with large language models☆30Updated last year
- ☆30Updated 10 months ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆34Updated 2 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆16Updated 3 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago
- ☆34Updated 2 years ago
- ☆21Updated last year
- TBC☆26Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆47Updated 2 years ago
- ☆32Updated last week
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 3 weeks ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆13Updated last year
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆26Updated last year
- ☆43Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 4 months ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Updated 2 years ago
- Continual Learning for Task-Oriented Dialogue Systems☆29Updated 2 years ago
- Supervised Contrastive Learning for Downstream Optimized Sequence Representations☆27Updated 3 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 9 months ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆12Updated last week
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆31Updated 2 years ago
- ☆25Updated 2 years ago