kaistAI/InstructIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaistAI/InstructIR)

kaistAI / InstructIR

IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focuses on user-aligned instructions tailored to each query instance.

☆32

Alternatives and similar repositories for InstructIR

Users that are interested in InstructIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaistAI / KtrlF
View on GitHub
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Oct 11, 2024Updated last year
naver-ai / ALMoST
View on GitHub
☆24Dec 2, 2023Updated 2 years ago
soheeyang / unified-prompt-selection
View on GitHub
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Nov 14, 2024Updated last year
kaistAI / How-Well-Do-LLMs-Truly-Ground
View on GitHub
☆11Sep 19, 2025Updated 10 months ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kaistAI / factual-knowledge-acquisition
View on GitHub
☆25Dec 12, 2025Updated 7 months ago
amy-hyunji / Contextualized-Generative-Retrieval
View on GitHub
☆16Oct 6, 2022Updated 3 years ago
kaistAI / GAP
View on GitHub
[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization
☆29Sep 12, 2024Updated last year
amy-hyunji / Generative-Multihop-Retrieval
View on GitHub
☆33Mar 31, 2023Updated 3 years ago
kaistAI / Knowledge-Entropy
View on GitHub
[ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
☆17Nov 25, 2024Updated last year
orionw / FollowIR
View on GitHub
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆56Jul 3, 2024Updated 2 years ago
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
DM2-ND / EDMem
View on GitHub
Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"
☆15Apr 24, 2023Updated 3 years ago
kaistAI / Volcano
View on GitHub
[NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…
☆49Aug 21, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
seonghyeonye / Flipped-Learning
View on GitHub
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆117Jun 28, 2025Updated last year
aviaefrat / lmentry
View on GitHub
☆15Nov 22, 2023Updated 2 years ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
kaistAI / Janus
View on GitHub
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆53Aug 10, 2025Updated 11 months ago
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
joeljang / Pretraining_T5_custom_dataset
View on GitHub
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆38Mar 21, 2021Updated 5 years ago
kaistAI / LangBridge
View on GitHub
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆97Oct 30, 2024Updated last year
naver-ai / elva
View on GitHub
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …
☆20Mar 13, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
neulab / data-agora
View on GitHub
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆40Dec 13, 2024Updated last year
lilakk / BLEUBERI
View on GitHub
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆32Jun 5, 2025Updated last year
AkariAsai / evidentiality_qa
View on GitHub
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Dec 25, 2022Updated 3 years ago
seonghyeonye / TAPP
View on GitHub
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Sep 13, 2024Updated last year
facebookresearch / tart
View on GitHub
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆167Oct 4, 2023Updated 2 years ago
dayeonki / mt_feedback
View on GitHub
Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]
☆14Apr 3, 2026Updated 3 months ago
prometheus-eval / prometheus-vision
View on GitHub
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…
☆86Sep 13, 2024Updated last year
SeungoneKim / SICK_Summarization
View on GitHub
[COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
☆25Mar 28, 2024Updated 2 years ago
guijinSON / MM-Eval
View on GitHub
Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"
☆20Oct 26, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhijing-jin / genwiki
View on GitHub
Dataset for the paper "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation"
☆25Jan 2, 2024Updated 2 years ago
prometheus-eval / prometheus
View on GitHub
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…
☆323Nov 11, 2023Updated 2 years ago
hangeol / UniR
View on GitHub
Official repo for paper: Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
☆20Nov 26, 2025Updated 7 months ago
orionw / promptriever
View on GitHub
The first dense retrieval model that can be prompted like an LM
☆93May 8, 2025Updated last year
LHRYANG / Generalization_of_FT-LLM
View on GitHub
Implementation of NAACL 2024 paper Unveiling the Generalization Power of Fine-Tuned Large Language Models
☆11Mar 14, 2024Updated 2 years ago
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
skywalker023 / fantom
View on GitHub
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
☆62May 31, 2024Updated 2 years ago