LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
269Updated last week

Alternatives and similar repositories for SFTvsRL:

Users that are interested in SFTvsRL are comparing it to the libraries listed below