THU-KEG / DICELinks
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
☆11Updated last year
Alternatives and similar repositories for DICE
Users that are interested in DICE are comparing it to the libraries listed below
Sorting:
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆20Updated 6 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆27Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆24Updated last year
- ☆24Updated 9 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27Updated 7 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Updated 2 years ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Updated last year
- ☆12Updated last year
- ☆23Updated last year
- ☆51Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆73Updated 5 months ago
- ☆31Updated 11 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆23Updated 3 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆72Updated 8 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Updated 8 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Updated 9 months ago
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Updated 5 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆68Updated last year
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆18Updated last year
- ☆20Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆34Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆127Updated 9 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆53Updated 2 weeks ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 8 months ago
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆33Updated 6 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Updated last year
- Exploration of automated dataset selection approaches at large scales.☆53Updated 10 months ago