AbhilashaRavichander / information-probingLinks
☆11Updated 4 months ago
Alternatives and similar repositories for information-probing
Users that are interested in information-probing are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆81Updated 2 weeks ago
- ☆22Updated 6 months ago
- ☆54Updated 10 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆24Updated last month
- Exploring Model Kinship for Merging Large Language Models☆26Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆87Updated 4 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆108Updated 11 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- ☆122Updated 7 months ago
- ☆38Updated 5 months ago
- ☆27Updated 3 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 6 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 11 months ago
- ☆73Updated 2 months ago
- Evaluating LLMs with fewer examples☆161Updated last year
- ☆35Updated 4 months ago
- ☆21Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆72Updated 5 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆130Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆99Updated 3 weeks ago
- ☆57Updated 11 months ago
- ☆56Updated 2 months ago
- ☆127Updated 11 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆105Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year