zhixuan-lin / forgetting-transformerLinks

[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
124Updated last week

Alternatives and similar repositories for forgetting-transformer

Users that are interested in forgetting-transformer are comparing it to the libraries listed below

Sorting: