amazon-science / mintaka
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
☆104Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for mintaka
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆99Updated last year
- ☆55Updated last year
- ☆67Updated 3 years ago
- ☆95Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆67Updated last year
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated 11 months ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- ☆97Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- ☆29Updated 9 months ago
- Contrastive Fact Verification☆70Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆63Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkit☆63Updated 11 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆58Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆54Updated 6 months ago
- ☆44Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆142Updated last year
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆153Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 2 years ago
- ☆33Updated last year
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆48Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆93Updated last year