kaiokendev / cutoff-len-is-context-len

Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
63Updated last year

Related projects

Alternatives and complementary repositories for cutoff-len-is-context-len