liuqi6777 / pe_rankLinks
Leveraging passage embeddings for efficient listwise reranking with large language models.
☆46Updated 9 months ago
Alternatives and similar repositories for pe_rank
Users that are interested in pe_rank are comparing it to the libraries listed below
Sorting:
- ☆52Updated 7 months ago
- Test-time compute in information retrieval☆42Updated 2 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆173Updated 2 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆77Updated last week
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆61Updated 9 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆89Updated last month
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆164Updated 3 months ago
- ☆58Updated 10 months ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆155Updated last month
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆84Updated last year
- Counting-Stars (★)☆83Updated 3 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆58Updated 3 months ago
- ☆105Updated last month
- Code implementation of synthetic continued pretraining☆127Updated 8 months ago
- ☆49Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆147Updated 10 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆120Updated 7 months ago
- ☆29Updated last week
- Towards Systematic Measurement for Long Text Quality☆37Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆116Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆52Updated 10 months ago
- ☆58Updated 10 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆185Updated last year
- ☆68Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62Updated 2 years ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆58Updated 11 months ago
- ☆96Updated 8 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 3 months ago
- ☆18Updated last year