xichen-fy / Fira

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
81Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for Fira