orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆39Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for FollowIR
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆56Updated 2 weeks ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆37Updated 2 weeks ago
- ☆37Updated 6 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- Official codebase for permutation self-consistency.☆16Updated 8 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆84Updated 3 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆18Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆59Updated 6 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆52Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆47Updated last month
- Repo for "On Learning to Summarize with Large Language Models as References"☆42Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆51Updated 6 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆54Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆30Updated 2 months ago
- ☆55Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆27Updated 7 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- ☆69Updated last year
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆12Updated last week
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆42Updated 2 months ago
- AbstainQA, ACL 2024☆19Updated 3 weeks ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 3 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆30Updated 3 months ago