J-Rosser-UK / Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B

Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.
16Updated last month

Alternatives and similar repositories for Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B:

Users that are interested in Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B are comparing it to the libraries listed below