asahi417 / lm-vocab-trimmer

Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a python-library vocabtrimmer, that remove irrelevant tokens from a multilingual LM vocabulary for the target language.
33Updated 3 months ago

Alternatives and similar repositories for lm-vocab-trimmer:

Users that are interested in lm-vocab-trimmer are comparing it to the libraries listed below