bryanchrist / MathNeuroLinks
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆20Updated 6 months ago
Alternatives and similar repositories for MathNeuro
Users that are interested in MathNeuro are comparing it to the libraries listed below
Sorting:
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆17Updated last year
- ☆23Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆118Updated 7 months ago
- ☆23Updated last year
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆11Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆38Updated last year
- ☆24Updated 8 months ago
- Exploration of automated dataset selection approaches at large scales.☆52Updated 9 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆56Updated 10 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆105Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆17Updated 4 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Updated last year
- ☆22Updated 5 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆27Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Updated last year
- ☆56Updated last year
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆25Updated last year
- ☆51Updated last year
- ☆18Updated 5 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 4 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆63Updated 4 months ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆23Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆126Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated 7 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- ☆29Updated last year
- A holistic benchmark for LLM abstention☆67Updated 4 months ago