mismayil / crowLinks
Benchmarking Commonsense Reasoning in Real-World Tasks
☆12Updated last year
Alternatives and similar repositories for crow
Users that are interested in crow are comparing it to the libraries listed below
Sorting:
- ☆33Updated 3 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- ☆11Updated last year
- FRANK: Factuality Evaluation Benchmark☆56Updated 2 years ago
- ☆9Updated 2 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆71Updated last year
- Code for Handling Divergent Reference Texts when Evaluating Table-to-Text Generation (Dhingra et al. 2019)☆31Updated 4 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Updated last year
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆22Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated 3 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆34Updated last month
- ☆15Updated 2 years ago
- ☆24Updated 2 years ago
- ☆48Updated 2 years ago
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆18Updated 3 years ago
- ☆10Updated 9 months ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Constrained decoding utilities for text generation using Huggingface seq2seq models☆24Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆35Updated 2 years ago
- ☆17Updated 3 months ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆26Updated last year
- ☆82Updated 2 years ago
- ☆17Updated last year
- A Python Commonsense Knowledge Inference Toolkit☆64Updated last year
- ☆39Updated 2 years ago