zhixuan-lin / forgetting-transformer
View external linksLinks

[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
137Dec 19, 2025Updated last month

Alternatives and similar repositories for forgetting-transformer

Users that are interested in forgetting-transformer are comparing it to the libraries listed below

Sorting:

Are these results useful?