mukhal / intrinsic-source-citation
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆16Updated 5 months ago
Alternatives and similar repositories for intrinsic-source-citation:
Users that are interested in intrinsic-source-citation are comparing it to the libraries listed below
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆18Updated this week
- Targeted Data Generation with Large Language Models☆14Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 6 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆80Updated 10 months ago
- ☆46Updated 2 months ago
- ☆16Updated 6 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated last week
- ☆46Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 6 months ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated 11 months ago
- ☆38Updated 3 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆40Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆30Updated last month
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 7 months ago
- ☆11Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆40Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- ☆48Updated 11 months ago
- ☆38Updated 9 months ago
- ☆27Updated 2 months ago
- ☆19Updated 2 months ago
- ☆56Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year