Haiyang-W / TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
211Updated this week

Related projects

Alternatives and complementary repositories for TokenFormer