aadityasingh/icl-dynamics

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aadityasingh/icl-dynamics)

aadityasingh / icl-dynamics

☆26

Alternatives and similar repositories for icl-dynamics

Users that are interested in icl-dynamics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fiveai / understanding_safety_finetuning
View on GitHub
Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)
☆12Oct 31, 2024Updated last year
alex-damian / EOS
View on GitHub
☆15Sep 29, 2022Updated 3 years ago
fjzzq2002 / WeightWatch
View on GitHub
Official Repository of Paper "Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs"
☆15Sep 25, 2025Updated 9 months ago
zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
Nix07 / finetuning
View on GitHub
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆32Oct 27, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GeneralUserModels / napsack
View on GitHub
☆16Apr 4, 2026Updated 3 months ago
Ber666 / reasoning-by-superposition
View on GitHub
Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)
☆44Oct 8, 2025Updated 9 months ago
AnonymousNIPS2019 / DeepnetHessian
View on GitHub
The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size
☆19May 19, 2019Updated 7 years ago
damek / specgd
View on GitHub
Code to generate figures of paper "When do spectral gradient updates help in deep learning?"
☆16Dec 3, 2025Updated 7 months ago
anadim / smallest-addition-transformer-claude-code
View on GitHub
6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.
☆22Feb 19, 2026Updated 5 months ago
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
ginevracoal / robustBNNs
View on GitHub
Code for paper "Robustness of Bayesian Neural Networks to Gradient-Based Attacks"
☆17Feb 26, 2024Updated 2 years ago
zzp1012 / SAM-in-Late-Phase
View on GitHub
[ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"
☆19Feb 20, 2025Updated last year
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Furyton / awesome-language-model-analysis
View on GitHub
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…
☆101Updated this week
evandez / relations
View on GitHub
How do transformer LMs encode relations?
☆59Feb 24, 2024Updated 2 years ago
saic-fi / LFA
View on GitHub
[ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models
☆27May 14, 2024Updated 2 years ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
Thinklab-SJTU / Fast-T2T
View on GitHub
[NeurIPS2024] Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization; [N…
☆22Jul 2, 2025Updated last year
LeiBAI / Paper-Writing-Rebuttal
View on GitHub
Some thoughts about writing scientific papers
☆23Nov 8, 2024Updated last year
HumanCompatibleAI / leela-interp
View on GitHub
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆31Jun 4, 2024Updated 2 years ago
allenbai01 / transformers-as-statisticians
View on GitHub
☆35Jul 5, 2023Updated 3 years ago
tim-lawson / mlsae
View on GitHub
Multi-Layer Sparse Autoencoders (ICLR 2025)
☆30Feb 6, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thinking-machines-lab / manifolds
View on GitHub
Supporting code for the blog post on modular manifolds.
☆126Sep 26, 2025Updated 9 months ago
n2cholas / progan-flax
View on GitHub
Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation
☆12May 24, 2021Updated 5 years ago
facebookresearch / SIMAT
View on GitHub
codebase for the SIMAT dataset and evaluation
☆39Feb 16, 2022Updated 4 years ago
UKPLab / cdcr-beyond-corpus-tailored
View on GitHub
📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora
☆10May 25, 2022Updated 4 years ago
Nebularaid2000 / bottleneck
View on GitHub
PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)
☆37Oct 30, 2024Updated last year
keyonvafa / inductive-bias-probes
View on GitHub
☆34Nov 30, 2025Updated 7 months ago
understanding-search / maze-transformer
View on GitHub
This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.
☆35Oct 28, 2025Updated 8 months ago
MikaStars39 / FeatureAlignment
View on GitHub
FeatureAlignment = Alignment + Mechanistic Interpretability
☆35Mar 8, 2025Updated last year
interp-reasoning / thought-anchors
View on GitHub
⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆137Oct 27, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nblt / F-SAM
View on GitHub
[CVPR 2024] Friendly Sharpness-Aware Minimization
☆37Oct 29, 2024Updated last year
diicellman / dynamite-dogs
View on GitHub
BH hackathon
☆14Apr 4, 2024Updated 2 years ago
science-of-finetuning / diffing-toolkit
View on GitHub
A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.
☆78Updated this week
kayoyin / interpret-lm
View on GitHub
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆63May 12, 2022Updated 4 years ago
yash-srivastava19 / arrakis
View on GitHub
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
☆31Jul 8, 2026Updated 2 weeks ago
huanranchen / LLMLandscape
View on GitHub
The loss landscape of Large Language Models resemble basin!
☆41Jul 8, 2025Updated last year
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year