pengr / DataManView on GitHub
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".
119Feb 7, 2026Updated last month

Alternatives and similar repositories for DataMan

Users that are interested in DataMan are comparing it to the libraries listed below

Sorting:

Are these results useful?