0xZee / DeepSeek-R1-FineTuning

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
11Updated 3 months ago

Alternatives and similar repositories for DeepSeek-R1-FineTuning

Users that are interested in DeepSeek-R1-FineTuning are comparing it to the libraries listed below

Sorting: