bryanchrist / MathNeuroLinks
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆20Updated 6 months ago
Alternatives and similar repositories for MathNeuro
Users that are interested in MathNeuro are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆17Updated last year
- Exploration of automated dataset selection approaches at large scales.☆50Updated 9 months ago
- ☆23Updated last year
- ☆25Updated 8 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆118Updated 7 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 11 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆37Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 10 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆21Updated last year
- ☆22Updated 4 months ago
- ☆51Updated 10 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆23Updated 2 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 2 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Updated 9 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 4 months ago
- ☆14Updated 10 months ago
- ☆56Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆56Updated 10 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆27Updated 11 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆123Updated 8 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆26Updated last month
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆52Updated 4 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆25Updated 11 months ago
- A holistic benchmark for LLM abstention☆64Updated 3 months ago
- ☆17Updated 4 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 6 months ago