WeiboAI / VibeThinkerLinks
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
☆527Updated 2 weeks ago
Alternatives and similar repositories for VibeThinker
Users that are interested in VibeThinker are comparing it to the libraries listed below
Sorting:
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆521Updated last month
- ☆1,226Updated 2 weeks ago
- Sparse Inferencing for transformer based LLMs☆215Updated 3 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆117Updated last week
- ☆301Updated 4 months ago
- Kyutai with an "eye"☆225Updated 8 months ago
- ☆158Updated 7 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆273Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆237Updated 2 weeks ago
- ☆127Updated 2 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆646Updated this week
- GRadient-INformed MoE☆264Updated last year
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆591Updated last month
- ☆843Updated 2 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆428Updated this week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆285Updated 2 months ago
- ☆709Updated this week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆751Updated 2 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆807Updated 4 months ago
- An open-source implementation of Whisper☆466Updated last month
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆192Updated last week
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆126Updated last week
- ☆413Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆100Updated 6 months ago
- The official GitHub Page for MiniMax☆60Updated 3 weeks ago
- Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"☆318Updated 3 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆166Updated 3 months ago
- LIMI: Less is More for Agency☆151Updated last month
- Agent0 Series: Self-Evolving Agents from Zero Data☆767Updated this week