trestad / Factual-Recall-MechanismLinks
The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.
☆13Updated last year
Alternatives and similar repositories for Factual-Recall-Mechanism
Users that are interested in Factual-Recall-Mechanism are comparing it to the libraries listed below
Sorting:
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Updated 10 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 6 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆89Updated last week
- ☆9Updated 6 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆56Updated 5 months ago
- ☆13Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆48Updated 2 weeks ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆51Updated this week
- ☆131Updated 3 weeks ago
- Extending context length of visual language models☆11Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆69Updated 3 months ago
- ☆46Updated 7 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 5 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- ☆24Updated 2 years ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆69Updated last week
- A Sober Look at Language Model Reasoning☆63Updated last week
- 📜 Paper list on decoding methods for LLMs and LVLMs☆48Updated last month
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆32Updated 4 months ago
- awesome SAE papers☆34Updated last week
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆41Updated 2 weeks ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆85Updated 7 months ago
- SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆15Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated 8 months ago
- ☆52Updated last week
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- ☆45Updated last month
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆111Updated last year
- ☆26Updated last year