gowitheflow-1998 / RAR-bLinks
☆22Updated last year
Alternatives and similar repositories for RAR-b
Users that are interested in RAR-b are comparing it to the libraries listed below
Sorting:
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆78Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆179Updated 2 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆120Updated last month
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆125Updated last year
- Official codebase for permutation self-consistency.☆18Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆123Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆143Updated last year
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆52Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Updated last year
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆134Updated last year
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆28Updated 2 years ago
- ☆58Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62Updated 2 years ago
- The repository for paper <Evaluating Open-QA Evaluation>☆25Updated last year
- Code and data for the FACTOR paper☆52Updated 2 years ago
- A comprehensive paper list of Reasoning over Tables.☆29Updated 3 years ago
- ☆189Updated 5 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆132Updated 9 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆60Updated 6 months ago
- ☆31Updated 9 months ago
- ☆130Updated 2 years ago
- contrastive decoding☆204Updated 3 years ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆75Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆61Updated last year
- paper list on reasoning in NLP☆194Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆148Updated last year
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Updated 11 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆49Updated last year
- ☆34Updated 3 weeks ago
- Scaling Sentence Embeddings with Large Language Models☆109Updated last year