jrosseruk / Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5BLinks

Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.
22Updated 7 months ago

Alternatives and similar repositories for Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B

Users that are interested in Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B are comparing it to the libraries listed below

Sorting: