mukhal / intrinsic-source-citationLinks

[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models

☆19

Alternatives and similar repositories for intrinsic-source-citation

Users that are interested in intrinsic-source-citation are comparing it to the libraries listed below

Sorting:

google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated 2 years ago
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆31Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆31Updated 10 months ago
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
mungg / FABLES
☆58Updated last year
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated 5 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Updated last month
allenai / super-benchmark
☆49Updated 7 months ago
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
zhichaoxu-shufe / context-aware-decoding-qfs
☆14Updated last year
allenai / bff
☆39Updated last year
AlexWan0 / infini-gram
An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆29Updated last week
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆22Updated 4 months ago
abhika-m / FAVA
☆75Updated last year
marzenakrp / nocha
☆53Updated last year
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆65Updated last year
allenai / SciRIFF
Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.
☆44Updated 8 months ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Updated 2 years ago
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆76Updated last year
interview-eval / interview-eval
Interview-based evaluation of LLMs
☆22Updated 10 months ago
csinva / iprompt
Finding semantically meaningful and accurate prompts.
☆48Updated 2 years ago
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆36Updated 7 months ago
lucy3 / whos_filtered
☆14Updated last year
oriram / spider
☆54Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
katzurik / NERetrieve
☆30Updated last year
jaehunjung1 / cascaded-selective-evaluation
☆26Updated 9 months ago