☆25Feb 20, 2026Updated last week
Alternatives and similar repositories for icl-dynamics
Users that are interested in icl-dynamics are comparing it to the libraries listed below
Sorting:
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆21Feb 19, 2026Updated last week
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- ☆29Nov 30, 2025Updated 3 months ago
- 100M tokens, no time limit, best val loss wins!☆103Updated this week
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- ☆23Jun 30, 2025Updated 8 months ago
- Code for paper "Robustness of Bayesian Neural Networks to Gradient-Based Attacks"☆17Feb 26, 2024Updated 2 years ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆98Dec 2, 2024Updated last year
- ☆46Jul 21, 2025Updated 7 months ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- How do transformer LMs encode relations?☆56Feb 24, 2024Updated 2 years ago
- The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.☆22Jun 9, 2022Updated 3 years ago
- [ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models☆26May 14, 2024Updated last year
- Benchmarking Optimizers for LLM Pretraining☆52Dec 30, 2025Updated 2 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated 3 weeks ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆63Updated this week
- ☆56Sep 17, 2025Updated 5 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆392Jan 7, 2026Updated last month
- ☆35Jul 5, 2023Updated 2 years ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Mar 8, 2025Updated 11 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆21Dec 19, 2025Updated 2 months ago
- Data access library for the MeerKAT radio telescope☆13Jan 21, 2026Updated last month
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- ☆52Oct 23, 2023Updated 2 years ago
- Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX☆46Dec 10, 2023Updated 2 years ago
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Oct 30, 2024Updated last year
- ☆14Apr 20, 2021Updated 4 years ago
- Release code for "A Bayesian formulation for estimating the composition of Earth's crust"☆10Apr 16, 2023Updated 2 years ago
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- Materials for Structural geology 2 course☆11Nov 11, 2019Updated 6 years ago
- Statistician is a framework of tools for generating statistical summaries of large collections of EO data managed in an ODC instance.☆12Jan 27, 2026Updated last month
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆10Nov 15, 2024Updated last year
- ☆12May 18, 2025Updated 9 months ago
- Project overview, roadmap and initial result reports☆11Aug 6, 2022Updated 3 years ago