kaiokendev / cutoff-len-is-context-len

Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
63Updated last year

Alternatives and similar repositories for cutoff-len-is-context-len:

Users that are interested in cutoff-len-is-context-len are comparing it to the libraries listed below