Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a python-library vocabtrimmer, that remove irrelevant tokens from a multilingual LM vocabulary for the target language.
☆63Oct 25, 2024Updated last year
Alternatives and similar repositories for lm-vocab-trimmer
Users that are interested in lm-vocab-trimmer are comparing it to the libraries listed below
Sorting: