trestad / Factual-Recall-Mechanism
The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.
☆13Updated 10 months ago
Alternatives and similar repositories for Factual-Recall-Mechanism:
Users that are interested in Factual-Recall-Mechanism are comparing it to the libraries listed below
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆12Updated 6 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆49Updated 4 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 10 months ago
- Code and Data Repo for [ICLR 2025] Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆20Updated 2 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆31Updated 7 months ago
- A Survey on the Honesty of Large Language Models☆53Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆104Updated 10 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 2 months ago
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆91Updated 3 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆60Updated last year
- AnchorAttention: Improved attention for LLMs long-context training☆205Updated last month
- The official repository of the Omni-MATH benchmark.☆71Updated 2 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆14Updated last year
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"☆42Updated 3 weeks ago
- [ICLR 24 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆17Updated last week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆112Updated 3 months ago
- ☆70Updated last month
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆16Updated 2 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆84Updated 4 months ago
- ☆13Updated 7 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated last month
- ☆16Updated 4 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆37Updated 2 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆55Updated last month
- ☆55Updated 3 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆18Updated last month