Haiyang-W / TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
β˜†335Updated last week

Related projects β“˜

Alternatives and complementary repositories for TokenFormer