model-similarity / lm-similarity
☆14Updated last month
Alternatives and similar repositories for lm-similarity:
Users that are interested in lm-similarity are comparing it to the libraries listed below
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆19Updated last month
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated last month
- ☆30Updated 2 months ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆25Updated last month
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆75Updated 5 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆23Updated last week
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 4 months ago
- We study toy models of skill learning.☆24Updated 2 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆84Updated 4 months ago
- ☆16Updated last month
- ☆74Updated 7 months ago
- ☆22Updated last month
- ☆13Updated last year
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆13Updated last week
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆19Updated 3 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 5 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆62Updated last week
- ☆60Updated 11 months ago
- ☆18Updated 8 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 4 months ago
- ☆32Updated 3 weeks ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated last month
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆60Updated 2 months ago
- ☆16Updated 2 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆71Updated 2 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- Knowledge Unlearning for Large Language Models☆20Updated 2 weeks ago
- ☆18Updated this week