MadryLab / DsDmLinks
โ49Updated last year
Alternatives and similar repositories for DsDm
Users that are interested in DsDm are comparing it to the libraries listed below
Sorting:
- AI Logging for Interpretability and Explainability๐ฌโ124Updated last year
- Test-time-training on nearest neighbors for large language modelsโ44Updated last year
- โ95Updated last year
- โ38Updated last year
- โ48Updated 2 months ago
- Language models scale reliably with over-training and on downstream tasksโ97Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".โ59Updated this week
- PASTA: Post-hoc Attention Steering for LLMsโ121Updated 7 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"โ169Updated last year
- โ51Updated 3 months ago
- โ43Updated last year
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51Updated last year
- โ45Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsโ46Updated last year
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"โ91Updated 2 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionโ123Updated 10 months ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)โ89Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"โ112Updated 3 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]โ138Updated 9 months ago
- A Sober Look at Language Model Reasoningโ75Updated 3 weeks ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptorsโ77Updated 6 months ago
- โ87Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".โ78Updated last year
- โ70Updated 3 years ago
- Function Vectors in Large Language Models (ICLR 2024)โ170Updated 2 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsโ44Updated 3 months ago
- โ96Updated 9 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)โ38Updated 8 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]โ17Updated last year
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the pโฆโ11Updated 5 months ago