jzhou316 / Post-DeepSeek-R1_LLM-RLLinks

Learning and research after DeepSeek-R1, around test-time computing, resurgence of RL, and new LLM learning/application paradigms.
14Updated last week

Alternatives and similar repositories for Post-DeepSeek-R1_LLM-RL

Users that are interested in Post-DeepSeek-R1_LLM-RL are comparing it to the libraries listed below

Sorting: