0xZee / DeepSeek-R1-FineTuning

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
11Updated last month

Alternatives and similar repositories for DeepSeek-R1-FineTuning:

Users that are interested in DeepSeek-R1-FineTuning are comparing it to the libraries listed below