sangmichaelxie / doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
306Updated 10 months ago

Related projects

Alternatives and complementary repositories for doremi