WeiboAI/VibeThinker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WeiboAI/VibeThinker)

WeiboAI / VibeThinker

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

☆1,487

Alternatives and similar repositories for VibeThinker

Users that are interested in VibeThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

z-lab / dflash
View on GitHub
DFlash: Block Diffusion for Flash Speculative Decoding
☆5,500May 10, 2026Updated 2 months ago
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,541Updated this week
PrismML-Eng / Bonsai-demo
View on GitHub
Bonsai Demo
☆1,888Updated this week
deepseek-ai / DeepSpec
View on GitHub
DeepSpec: a full-stack codebase for training and evaluating speculative decoding algorithms
☆6,702Jul 9, 2026Updated last week
SamsungSAILMontreal / TinyRecursiveModels
View on GitHub
☆6,573Apr 1, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rlresearch / dr-tulu
View on GitHub
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
☆687Jun 17, 2026Updated last month
RyanCodrai / turbovec
View on GitHub
A vector index built on TurboQuant, written in Rust with Python bindings
☆13,639Updated this week
sapientinc / HRM
View on GitHub
Hierarchical Reasoning Model Official Release
☆12,598Mar 31, 2026Updated 3 months ago
alexzhang13 / rlm
View on GitHub
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆5,272Jun 26, 2026Updated 3 weeks ago
MoonshotAI / Kimi-Linear
View on GitHub
☆1,466Nov 17, 2025Updated 8 months ago
microsoft / BitNet
View on GitHub
Official inference framework for 1-bit LLMs
☆39,765Updated this week
aiming-lab / Agent0
View on GitHub
[COLM'26 & ICML'26] Agent0 Series: Self-Evolving Agents from Zero Data
☆1,233Jul 10, 2026Updated last week
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,496Updated this week
sapientinc / HRM-Text
View on GitHub
HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
☆1,709Jun 17, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
QwenLM / Qwen-AgentWorld
View on GitHub
Qwen-AgentWorld: Language World Models for General Agents
☆853Updated this week
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆951Apr 9, 2026Updated 3 months ago
Liquid4All / cookbook
View on GitHub
Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK
☆2,124Updated this week
facebookresearch / HyperAgents
View on GitHub
Self-referential self-improving agents that can optimize for any computable task
☆2,644May 9, 2026Updated 2 months ago
microsoft / rStar
View on GitHub
☆1,422Sep 12, 2025Updated 10 months ago
MBZUAI-IFM / K2-Think-SFT
View on GitHub
☆131Sep 9, 2025Updated 10 months ago
SakanaAI / sparser-faster-llms
View on GitHub
Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.
☆253Jun 29, 2026Updated 3 weeks ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,708Updated this week
SakanaAI / text-to-lora
View on GitHub
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆1,294Jun 8, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JustVugg / colibri
View on GitHub
Run GLM-5.2 (744B MoE) on a 25GB-RAM consumer machine — pure C, zero deps, experts streamed from disk. Tiny engine, immense model. 🐦
☆16,775Updated this week
MiniMax-AI / MiniMax-M2
View on GitHub
MiniMax-M2, a model built for Max coding & agentic workflows.
☆2,601Nov 13, 2025Updated 8 months ago
aaif-goose / goose
View on GitHub
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
☆51,333Updated this week
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆625Feb 15, 2026Updated 5 months ago
huggingface / OpenEnv
View on GitHub
An interface library for RL post training with environments.
☆2,436Updated this week
NVlabs / Sana
View on GitHub
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆8,498Updated this week
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,691Feb 27, 2026Updated 4 months ago
deepseek-ai / Engram
View on GitHub
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆4,534Jan 14, 2026Updated 6 months ago
facebookresearch / llm_souping
View on GitHub
Model souping for LLMs
☆73Nov 18, 2025Updated 8 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
QwenLM / qwen-code
View on GitHub
An open-source AI coding agent that lives in your terminal.
☆26,159Updated this week
ZHZisZZ / dllm
View on GitHub
dLLM: Simple Diffusion Language Modeling
☆2,651Updated this week
StarTrail-org / LEANN
View on GitHub
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …
☆12,711Updated this week
RecursiveMAS / RecursiveMAS
View on GitHub
Offical Implementation for "Recursive Multi-Agent Systems"
☆893Jun 29, 2026Updated 3 weeks ago
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,447Jun 30, 2026Updated 3 weeks ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆86,727Updated this week
apple / ml-ssd
View on GitHub
☆796Apr 16, 2026Updated 3 months ago