socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆39Updated last year
Alternatives and similar repositories for tttlm:
Users that are interested in tttlm are comparing it to the libraries listed below
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆50Updated last month
- Code for "Reasoning to Learn from Latent Thoughts"☆89Updated 3 weeks ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆50Updated 3 weeks ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 7 months ago
- ☆16Updated last week
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆54Updated 6 months ago
- ☆50Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 4 months ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆22Updated last week
- ☆47Updated last year
- Exploration of automated dataset selection approaches at large scales.☆37Updated last month
- ☆39Updated last year
- Learning adapter weights from task descriptions☆16Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆53Updated 4 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆62Updated 6 months ago
- ☆89Updated 3 weeks ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 5 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆38Updated 3 weeks ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- ☆50Updated this week
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆17Updated 11 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆24Updated 8 months ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆16Updated last year
- ☆37Updated last year
- Codebase for decoding compressed trust.☆23Updated 11 months ago
- ☆23Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆114Updated 3 weeks ago
- ☆62Updated 4 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆36Updated 9 months ago