Rachum-thu / LongPiBench
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆12Updated 4 months ago
Alternatives and similar repositories for LongPiBench:
Users that are interested in LongPiBench are comparing it to the libraries listed below
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 6 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆20Updated 2 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆43Updated 3 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- ☆9Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 10 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated last month
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆24Updated last year
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆46Updated 2 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆30Updated last week
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆36Updated 9 months ago
- Large language models for document ranking.☆48Updated last week
- The code implementation of Symbolic-MoE☆27Updated last month
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated 10 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆112Updated 11 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆40Updated 4 months ago
- ☆13Updated 4 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆48Updated last month
- The original Shared Recurrent Memory Transformer implementation☆23Updated 3 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆35Updated 2 weeks ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆40Updated 2 months ago
- ☆62Updated 3 weeks ago