socialfoundations / tttlmLinks
Test-time-training on nearest neighbors for large language models
☆48Updated last year
Alternatives and similar repositories for tttlm
Users that are interested in tttlm are comparing it to the libraries listed below
Sorting:
- A Sober Look at Language Model Reasoning☆89Updated 2 weeks ago
- Code for "Reasoning to Learn from Latent Thoughts"☆122Updated 8 months ago
- ☆51Updated last year
- ☆102Updated 2 years ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆187Updated last year
- ☆51Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆62Updated 3 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆37Updated last year
- ☆18Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Updated last year
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆59Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆186Updated 7 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆142Updated 4 months ago
- AI Logging for Interpretability and Explainability🔬☆133Updated last year
- ☆52Updated 7 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆48Updated 6 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆150Updated 5 months ago
- ☆80Updated 3 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆31Updated 10 months ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆46Updated 8 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆119Updated 11 months ago
- ☆46Updated last year
- ☆43Updated 2 years ago
- GenRM-CoT: Data release for verification rationales☆66Updated last year
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆105Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆63Updated last year
- ☆134Updated 8 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆123Updated last year
- Learning adapter weights from task descriptions☆19Updated 2 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Updated last year