xlang-ai / BRIGHTView external linksLinks
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆189Sep 13, 2025Updated 5 months ago
Alternatives and similar repositories for BRIGHT
Users that are interested in BRIGHT are comparing it to the libraries listed below
Sorting:
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆218Jun 24, 2025Updated 7 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated last month
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Oct 11, 2025Updated 4 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆175Dec 11, 2025Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Model implementation for the contextual embeddings project☆40Jun 2, 2025Updated 8 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 10 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆52Jul 3, 2024Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆19Mar 31, 2025Updated 10 months ago
- Document Ranking with Large Language Models.☆202Updated this week
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated last month
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆576Updated this week
- Generative Representational Instruction Tuning☆686Jun 25, 2025Updated 7 months ago
- ☆161Apr 17, 2024Updated last year
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆50Feb 10, 2025Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆218Nov 27, 2025Updated 2 months ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- ☆16Sep 17, 2024Updated last year
- [TOIS 2025] This is the official repository for the generative information retrieval survey.☆201Apr 5, 2025Updated 10 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆50Dec 7, 2024Updated last year
- ☆11Mar 13, 2023Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Oct 14, 2025Updated 4 months ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆25Jun 6, 2025Updated 8 months ago
- official repository for ListT5☆48Nov 27, 2025Updated 2 months ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆41Dec 12, 2025Updated 2 months ago
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated 10 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,021Nov 13, 2025Updated 3 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆63Dec 10, 2025Updated 2 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆136Aug 26, 2024Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆723Jan 26, 2026Updated 3 weeks ago
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 8 months ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆31Nov 30, 2025Updated 2 months ago
- code for training & evaluating Contextual Document Embedding models☆202May 14, 2025Updated 9 months ago