Haiyang-W / TokenFormerLinks

[ICLR2025 SpotlightπŸ”₯] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
β˜†559Updated 3 months ago

Alternatives and similar repositories for TokenFormer

Users that are interested in TokenFormer are comparing it to the libraries listed below

Sorting: