QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
☆38Jan 20, 2026Updated 2 months ago
Alternatives and similar repositories for QRHead
Users that are interested in QRHead are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated last month
- ☆14Jan 10, 2024Updated 2 years ago
- ☆11Mar 25, 2022Updated 4 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 3 months ago
- SIGIR'20: An Analysis of BERT in Document Ranking☆21Jul 27, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP2020] End-to-End Emotion-Cause Pair Extraction based on SlidingWindow Multi-Label Learning☆20Oct 13, 2020Updated 5 years ago
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆50Jun 30, 2025Updated 9 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 4 months ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- The official repository of MM-R5☆29Jun 22, 2025Updated 9 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆13Oct 14, 2024Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆42Mar 31, 2025Updated last year
- ☆29Oct 8, 2025Updated 6 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆24Jun 28, 2025Updated 9 months ago
- ☆14Jun 9, 2017Updated 8 years ago
- ☆19May 19, 2024Updated last year
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ☆14Jul 17, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆25Apr 10, 2025Updated last year
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Benchmark datasets for sentiment analysis☆11May 18, 2020Updated 5 years ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆30Mar 26, 2024Updated 2 years ago
- Distribution Aware Tuning☆16Aug 29, 2024Updated last year
- Source code for the paper 'Complex Hyperbolic Knowledge Graph Embeddings with Fast Fourier Transform'.☆12Nov 9, 2022Updated 3 years ago
- ☆11Oct 9, 2022Updated 3 years ago
- ☆18Mar 30, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆237Aug 2, 2024Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆250Sep 12, 2025Updated 7 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆55Apr 3, 2026Updated 2 weeks ago
- Source code for the paper "Controlling the Risk of Conversational Search via Reinforcement Learning" and "Simulating and Modeling the Ris…☆12Aug 11, 2023Updated 2 years ago
- On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification☆29Nov 30, 2022Updated 3 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- ☆18Jul 11, 2021Updated 4 years ago