ad-freiburg / large-qa-datasets
A collection of large question answering datasets
☆335Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for large-qa-datasets
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆681Updated last year
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆324Updated 2 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆458Updated last month
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆293Updated 5 months ago
- Expanding natural instructions☆958Updated 11 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆193Updated 9 months ago
- Build Text Rerankers with Deep Language Models☆251Updated 8 months ago
- Scalable training for dense retrieval models.☆270Updated last year
- ☆333Updated 11 months ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆519Updated 3 weeks ago
- All-in-one text de-duplication☆620Updated 5 months ago
- Long Document Summarization Papers☆136Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆141Updated 7 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆408Updated 9 months ago
- [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models☆549Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆105Updated 2 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆477Updated 6 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆467Updated 4 months ago
- ☆219Updated 5 months ago
- Fusion-in-Decoder☆550Updated last year
- ☆622Updated 3 weeks ago
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆315Updated 10 months ago
- ☆680Updated last month
- ☆262Updated 10 months ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆524Updated 8 months ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆428Updated 2 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,607Updated 3 months ago
- ☆166Updated last year
- Tools for checking ACL paper submissions☆598Updated 3 weeks ago