zhixuan-lin / forgetting-transformerView on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
141Feb 25, 2026Updated last week

Alternatives and similar repositories for forgetting-transformer

Users that are interested in forgetting-transformer are comparing it to the libraries listed below

Sorting:

Are these results useful?