mukhal / intrinsic-source-citationLinks
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆19Updated 8 months ago
Alternatives and similar repositories for intrinsic-source-citation
Users that are interested in intrinsic-source-citation are comparing it to the libraries listed below
Sorting:
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆31Updated 11 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- ☆53Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 6 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- ☆49Updated 8 months ago
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 4 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated 2 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆33Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆25Updated 6 months ago
- Few-shot Learning with Auxiliary Data☆31Updated 2 years ago
- ☆14Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆40Updated 8 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆49Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- ☆59Updated last year
- Embedding Recycling for Language models☆38Updated 2 years ago
- code associated with WANLI dataset in Liu et al., 2022☆31Updated 2 years ago
- ☆13Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- ☆44Updated last year
- ☆46Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- ☆24Updated last year