Gryphe / BlockMerge_Gradient

Merge Transformers language models by use of gradient parameters.
203Updated 5 months ago

Alternatives and similar repositories for BlockMerge_Gradient:

Users that are interested in BlockMerge_Gradient are comparing it to the libraries listed below