KaihuaTang / Qwen-Tokenizer-Pruner

Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen and Qwen-VL.
12Updated 6 months ago

Alternatives and similar repositories for Qwen-Tokenizer-Pruner:

Users that are interested in Qwen-Tokenizer-Pruner are comparing it to the libraries listed below