kstats / CIMA
☆21Updated 3 years ago
Alternatives and similar repositories for CIMA:
Users that are interested in CIMA are comparing it to the libraries listed below
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 6 months ago
- FRANK: Factuality Evaluation Benchmark☆52Updated 2 years ago
- ☆47Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 10 months ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆78Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- ☆97Updated 2 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆14Updated 3 months ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆75Updated 3 months ago
- ☆44Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 11 months ago
- ☆84Updated 5 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated last year
- Multilingual Dialogue Datasets☆19Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 7 months ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆27Updated 2 years ago
- ☆15Updated 3 years ago
- An open source toolkit for multimodal generative conversational task assistants, helping assist people with real-world complex tasks☆35Updated 8 months ago
- Contrastive Fact Verification☆71Updated 2 years ago
- A Large-Scale Dataset for Empathetic Response Generation☆41Updated 10 months ago
- ☆33Updated last year
- a tool for calcualting character n-gram F score☆69Updated 2 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 4 years ago
- ☆38Updated 2 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Updated last year