WeiboAI / VibeThinkerLinks
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
☆155Updated this week
Alternatives and similar repositories for VibeThinker
Users that are interested in VibeThinker are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆131Updated 4 months ago
- ☆102Updated last year
- ☆49Updated 9 months ago
- CursorCore: Assist Programming through Aligning Anything☆132Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆53Updated last month
- ☆177Updated last week
- GRadient-INformed MoE☆264Updated last year
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- Kyutai with an "eye"☆223Updated 7 months ago
- ☆62Updated 4 months ago
- ☆28Updated 3 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆207Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 5 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- ☆57Updated 9 months ago
- ☆55Updated 11 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆478Updated last week
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 8 months ago
- Sparse Inferencing for transformer based LLMs☆201Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated 3 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 6 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆95Updated 6 months ago
- ☆87Updated 5 months ago
- Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.☆125Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- Very minimal (and stateless) agent framework☆45Updated 10 months ago
- ☆77Updated last week
- ☆158Updated 6 months ago