MBZUAI-IFM / K2-Think-SFTLinks
☆127Updated 4 months ago
Alternatives and similar repositories for K2-Think-SFT
Users that are interested in K2-Think-SFT are comparing it to the libraries listed below
Sorting:
- All information and news with respect to Falcon-H1 series☆95Updated 3 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆250Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 4 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆480Updated last week
- Code for Bolmo: Byteifying the Next Generation of Language Models☆112Updated 2 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆355Updated 6 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B