MBZUAI-IFM / K2-Think-SFTLinks
☆112Updated last month
Alternatives and similar repositories for K2-Think-SFT
Users that are interested in K2-Think-SFT are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆220Updated last week
- ☆300Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆345Updated 3 months ago
- ☆103Updated 3 months ago
- All information and news with respect to Falcon-H1 series☆91Updated last week
- ☆157Updated 6 months ago
- Sparse Inferencing for transformer based LLMs☆201Updated 2 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆189Updated 2 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 3 months ago
- accompanying material for sleep-time compute paper☆117Updated 5 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆193Updated 2 weeks ago
- LIMI: Less is More for Agency☆138Updated this week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- RLP: Reinforcement as a Pretraining Objective☆182Updated 2 weeks ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆52Updated 2 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated last month
- ☆62Updated 3 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated 3 weeks ago
- Kyutai with an "eye"☆222Updated 6 months ago
- An open-source implementation of Whisper☆447Updated last week
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆104Updated this week
- SoTA Approach for ARC-AGI 2☆103Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆671Updated 3 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated 2 months ago
- Esoteric Language Models☆101Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆273Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 5 months ago