Haiyang-W / TokenFormer

[ICLR2025 SpotlightπŸ”₯] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
β˜†514Updated last week

Alternatives and similar repositories for TokenFormer:

Users that are interested in TokenFormer are comparing it to the libraries listed below