orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆40Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for FollowIR
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆57Updated last month
- Official codebase for permutation self-consistency.☆16Updated 9 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆38Updated last month
- ☆39Updated 7 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- ☆55Updated last year
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆52Updated 6 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- ☆69Updated last year
- Comprehensive benchmark for RAG☆39Updated 2 weeks ago
- ☆18Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆18Updated 3 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 5 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆83Updated 4 months ago
- Retrieval as Attention☆83Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆37Updated 4 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆96Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆81Updated 3 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 9 months ago
- ☆44Updated 2 months ago
- ☆29Updated 9 months ago
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆13Updated 3 weeks ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆53Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆61Updated 7 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 7 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Codebase for [Paper] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆14Updated 8 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆31Updated 3 months ago