DolbyUUU / Logic-RL-Lite

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
81Updated 2 weeks ago

Alternatives and similar repositories for Logic-RL-Lite:

Users that are interested in Logic-RL-Lite are comparing it to the libraries listed below