pangu-tech / pangu-ultraLinks

☆72

Alternatives and similar repositories for pangu-ultra

Users that are interested in pangu-ultra are comparing it to the libraries listed below

Sorting:

Tencent / llm.hunyuan.T1
☆84Updated 6 months ago
SkyworkAI / Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆137Updated last year
MiroMindAI / MiroMind-M1
MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.
☆239Updated 2 months ago
bigai-nlco / TokenSwift
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆116Updated 5 months ago
yyht / openrlhf_async_pipline
☆83Updated 2 months ago
Tongyi-Zhiwen / QwenLong-L1
☆299Updated 5 months ago
MoonshotAI / Kimi-Researcher
☆73Updated 4 months ago
inclusionAI / Ling-V2
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.
☆191Updated 3 weeks ago
JT-Ushio / MHA2MLA
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
☆191Updated 3 weeks ago
inclusionAI / Ring
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
☆105Updated 2 months ago
MiniMax-AI / SynLogic
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆176Updated 3 months ago
THUDM / DeepDive
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆190Updated last month
Infini-AI-Lab / Multiverse
☆100Updated last month
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆220Updated 3 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
MiroMindAI / MiroRL
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆169Updated 2 months ago
InternLM / OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Updated 7 months ago
ZihanWang314 / CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
☆222Updated last month
efficientscaling / Z1
[EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"
☆66Updated 6 months ago
GAIR-NLP / PC-Agent-E
Efficient Agent Training for Computer Use
☆132Updated last month
Multiverse4FM / Multiverse
☆81Updated 4 months ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆449Updated 5 months ago
GAIR-NLP / MAYE
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
☆142Updated 6 months ago
plm-team / PLM
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆20Updated 7 months ago
hao-ai-lab / Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆198Updated 5 months ago
MiniMax-AI / One-RL-to-See-Them-All
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
☆320Updated 5 months ago
AQ-MedAI / MrlX
MrlX: A Multi-Agent Reinforcement Learning Framework
☆116Updated this week
SkyworkAI / skywork-o1-prm-inference
☆65Updated 11 months ago
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 11 months ago
SuperGPQA / SuperGPQA
☆169Updated 6 months ago