☆25Feb 20, 2026Updated last month
Alternatives and similar repositories for icl-dynamics
Users that are interested in icl-dynamics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- ☆31Nov 30, 2025Updated 3 months ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 4 months ago
- [NeurIPS2024] Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization; [N…☆21Jul 2, 2025Updated 8 months ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- ☆23Jun 30, 2025Updated 8 months ago
- How do transformer LMs encode relations?☆56Feb 24, 2024Updated 2 years ago
- [ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models☆26May 14, 2024Updated last year
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆69Updated this week
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆393Jan 7, 2026Updated 2 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last month
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 7 months ago
- Code from Machine Learning competitions on Kaggle☆11Apr 1, 2021Updated 4 years ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 5 months ago
- All-in-One Safety Evaluation Framwork☆46Mar 4, 2026Updated 2 weeks ago
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- Supporting code for the blog post on modular manifolds.☆120Sep 26, 2025Updated 5 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆61May 12, 2022Updated 3 years ago
- A remote Scala code evaluator☆14May 16, 2023Updated 2 years ago
- 100M tokens. Infinite compute. Lowest val loss wins.☆310Updated this week
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆15Feb 12, 2025Updated last year
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆207Dec 27, 2024Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- ☆13Jun 29, 2024Updated last year
- ☆20Jun 6, 2018Updated 7 years ago
- Library that provides metrics to assess representation quality☆26Feb 5, 2025Updated last year
- Yet another web-based presentation library☆17Jul 5, 2019Updated 6 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- A source plugin for Gatsby to source Github data from its GraphQL API for static builds☆18Aug 17, 2021Updated 4 years ago
- CIFAR10 ResNets implemented in JAX+Flax☆12Apr 6, 2022Updated 3 years ago
- ☆119Feb 11, 2025Updated last year