EleutherAI / features-across-timeLinks

Understanding how features learned by neural networks evolve throughout training

☆39

Alternatives and similar repositories for features-across-time

Users that are interested in features-across-time are comparing it to the libraries listed below

Sorting:

EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
KaiNylund / lm-weights-encode-time
☆69Updated last year
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆20Updated 10 months ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆64Updated last year
ekinakyurek / google-research
Google Research
☆46Updated 3 years ago
srush / LLM-Talk
☆52Updated last year
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 10 months ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated 2 years ago
EleutherAI / training-jacobian
☆24Updated 11 months ago
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆40Updated last year
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
mcleish7 / gemstone-scaling-laws
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆30Updated 2 months ago
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆31Updated last year
ltgoslo / bert-in-context
Official implementation of "BERTs are Generative In-Context Learners"
☆32Updated 8 months ago
justinlovelace / Diffusion-Guided-LM
☆29Updated last month
jxiw / BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …
☆115Updated last year
Aleph-Alpha-Research / trigrams
☆58Updated 2 weeks ago
RobertCsordas / moeut
☆89Updated last year
KihoPark / LLM_Categorical_Hierarchical_Representations
☆111Updated 9 months ago
nrimsky / InfluenceFunctions
Implementation of Influence Function approximations for differently sized ML models, using PyTorch
☆15Updated 2 years ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆60Updated 2 months ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆31Updated 10 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆130Updated 3 years ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆40Updated 2 years ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆95Updated 2 months ago
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆37Updated 2 years ago