MadryLab / DsDmLinks
β51Updated 2 years ago
Alternatives and similar repositories for DsDm
Users that are interested in DsDm are comparing it to the libraries listed below
Sorting:
- Language models scale reliably with over-training and on downstream tasksβ99Updated last year
- AI Logging for Interpretability and Explainabilityπ¬β140Updated last year
- β103Updated 2 years ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)β42Updated 3 weeks ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Updated last year
- Test-time-training on nearest neighbors for large language modelsβ49Updated last year
- β108Updated last year
- β41Updated 2 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Updated 2 years ago
- Code accompanying the paper "Massive Activations in Large Language Models"β195Updated last year
- [NeurIPS'24 Spotlight] Observational Scaling Lawsβ58Updated last year
- β53Updated 9 months ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)β32Updated 4 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionβ124Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)β81Updated 2 years ago
- Replicating O1 inference-time scaling lawsβ92Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]β147Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)β84Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]β21Updated last year
- β62Updated 8 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"β23Updated 9 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".β64Updated 5 months ago
- Lightweight Adapting for Black-Box Large Language Modelsβ25Updated last year
- β51Updated 2 years ago
- Function Vectors in Large Language Models (ICLR 2024)β190Updated 9 months ago
- β80Updated 3 years ago
- β20Updated 3 months ago
- β74Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsβ47Updated 9 months ago
- β208Updated 2 years ago