QwenLM / ParScaleLinks

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

☆428

Alternatives and similar repositories for ParScale

Users that are interested in ParScale are comparing it to the libraries listed below

Sorting:

JT-Ushio / MHA2MLA
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
☆183Updated last month
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆543Updated last week
ZihanWang314 / CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
☆219Updated last month
RyanLiu112 / compute-optimal-tts
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
☆268Updated 5 months ago
Tongyi-Zhiwen / QwenLong-L1
☆288Updated 2 months ago
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆207Updated 2 weeks ago
step-law / steplaw
☆198Updated 3 months ago
sail-sg / oat-zero
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
☆245Updated 3 months ago
Gen-Verse / ReasonFlux
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
☆470Updated 2 weeks ago
ByteDance-Seed / Seed-Thinking-v1.5
☆802Updated 2 months ago
SuperGPQA / SuperGPQA
☆157Updated 3 months ago
hao-ai-lab / Dynasor
Simple extension on vLLM to help you speed up reasoning model without training.
☆172Updated 2 months ago
Tencent / llm.hunyuan.T1
☆77Updated 4 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆274Updated 2 months ago
InternLM / OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆188Updated 4 months ago
fxmeng / TransMLA
TransMLA: Multi-Head Latent Attention Is All You Need
☆335Updated 3 weeks ago
eddycmu / demystify-long-cot
☆309Updated 2 months ago
HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆431Updated 9 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆265Updated 2 weeks ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆234Updated 2 months ago
MiniMax-AI / SynLogic
The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆161Updated last month
ypwang61 / One-Shot-RLVR
official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”
☆337Updated last week
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆344Updated 7 months ago
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆169Updated last month
microsoft / LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
☆240Updated 11 months ago
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated last month
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆344Updated 3 weeks ago
zwhe99 / DeepMath
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆243Updated 2 months ago
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆248Updated 2 months ago
thunlp / InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…
☆374Updated last year