MingLiiii / Layer_Gradient

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
59Updated 3 months ago

Alternatives and similar repositories for Layer_Gradient:

Users that are interested in Layer_Gradient are comparing it to the libraries listed below