WeiboAI / VibeThinkerLinks
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
☆560Updated last month
Alternatives and similar repositories for VibeThinker
Users that are interested in VibeThinker are comparing it to the libraries listed below
Sorting:
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆550Updated 2 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆550Updated last week
- ☆1,257Updated last month
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆527Updated 3 weeks ago
- ☆301Updated 5 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆547Updated last week
- ☆431Updated last month
- Sparse Inferencing for transformer based LLMs☆217Updated 5 months ago
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆134Updated last month
- Kyutai with an "eye"☆233Updated 9 months ago
- ☆508Updated 3 weeks ago
- ☆720Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆356Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆795Updated 3 weeks ago
- ☆856Updated 4 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆810Updated 6 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆510Updated this week
- ☆158Updated 8 months ago
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆621Updated 2 months ago
- ☆128Updated 4 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆399Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆357Updated 6 months ago
- A command-line interface tool for serving LLM using vLLM.☆461Updated last month
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆197Updated last month
- An open-source implementation of Whisper☆472Updated 2 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆686Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆250Updated last month
- OpenCUA: Open Foundations for Computer-Use Agents☆633Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 4 months ago
- GRadient-INformed MoE☆264Updated last year