mukhal / intrinsic-source-citation
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆16Updated 6 months ago
Alternatives and similar repositories for intrinsic-source-citation:
Users that are interested in intrinsic-source-citation are comparing it to the libraries listed below
- Interview-based evaluation of LLMs☆15Updated last month
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆42Updated 7 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 2 weeks ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- ☆26Updated 8 months ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- ☆11Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 8 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 2 years ago
- https://arxiv.org/abs/2404.10917☆14Updated 8 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆20Updated last month
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated 11 months ago
- ☆15Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- ☆49Updated 3 months ago
- ☆40Updated last week
- ☆55Updated 2 years ago
- Pre-train Static Word Embeddings☆47Updated 3 weeks ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆34Updated 2 months ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)☆44Updated 3 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆27Updated 6 months ago