DolbyUUU / DeepEnlightenLinks

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
37Updated 2 months ago

Alternatives and similar repositories for DeepEnlighten

Users that are interested in DeepEnlighten are comparing it to the libraries listed below

Sorting: