Deepseek R1 zero tiny version own reproduce on two A100s.
☆84Feb 1, 2025Updated last year
Alternatives and similar repositories for TinyZero
Users that are interested in TinyZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆28Oct 10, 2024Updated last year
- Minimal reproduction of DeepSeek R1-Zero☆13,014Feb 27, 2026Updated last month
- ☆37Feb 4, 2026Updated 2 months ago
- ☆10Feb 2, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆45Mar 27, 2026Updated 2 weeks ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 9 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 7 months ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆31Jan 27, 2026Updated 2 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 2 months ago
- ☆25May 30, 2023Updated 2 years ago
- Task dependent skill transformation is challenging due to the ignorance of the relationships between primitive skills. In this project, w…