OSU-NLP-Group / GrokkedTransformer

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
161Updated last month

Related projects

Alternatives and complementary repositories for GrokkedTransformer