DolbyUUU / DeepEnlighten

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
27Updated last week

Alternatives and similar repositories for DeepEnlighten:

Users that are interested in DeepEnlighten are comparing it to the libraries listed below