ad-freiburg / large-qa-datasets
A collection of large question answering datasets
☆358Updated 7 months ago
Alternatives and similar repositories for large-qa-datasets:
Users that are interested in large-qa-datasets are comparing it to the libraries listed below
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆474Updated 4 months ago
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆344Updated 5 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆709Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆526Updated last year
- Tevatron - A flexible toolkit for neural retrieval research and development.☆557Updated this week
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆492Updated 7 months ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆291Updated last year
- Build Text Rerankers with Deep Language Models☆257Updated 11 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆340Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆318Updated 8 months ago
- Expanding natural instructions☆973Updated last year
- Scalable training for dense retrieval models.☆275Updated last year
- Efficient Attention for Long Sequence Processing☆92Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆328Updated last year
- Fusion-in-Decoder☆559Updated last year
- ☆471Updated 3 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆195Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆490Updated 2 weeks ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,699Updated last week
- ☆272Updated last year
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,749Updated last year
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆431Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆259Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆151Updated 10 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- Tools for checking ACL paper submissions☆646Updated 3 months ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆347Updated last year
- ☆348Updated last year
- ☆155Updated 7 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,741Updated last week