DolbyUUU / DeepEnlightenLinks

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
38Updated 3 months ago

Alternatives and similar repositories for DeepEnlighten

Users that are interested in DeepEnlighten are comparing it to the libraries listed below

Sorting: