pangu-tech / pangu-ultraLinks
☆57Updated 3 weeks ago
Alternatives and similar repositories for pangu-ultra
Users that are interested in pangu-ultra are comparing it to the libraries listed below
Sorting:
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- Repo for "Z1: Efficient Test-time Scaling with Code"☆61Updated 2 months ago
- ☆104Updated 3 weeks ago
- ☆53Updated last week
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆131Updated 2 months ago
- ☆36Updated last week
- ☆58Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆186Updated 3 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆105Updated last month
- ☆77Updated 2 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆32Updated last month
- Simple extension on vLLM to help you speed up reasoning model without training.☆161Updated 3 weeks ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆142Updated 2 weeks ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 4 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆275Updated 3 weeks ago
- Linear Attention Sequence Parallelism (LASP)☆84Updated last year
- ☆86Updated last month
- qwen-nsa☆67Updated 2 months ago
- ☆80Updated 5 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆30Updated last month
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆176Updated last week
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆41Updated last month
- ☆112Updated this week
- FuseAI Project☆87Updated 5 months ago
- A High-Efficiency System of Large Language Model Based Search Agents☆56Updated 3 weeks ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- An Open Math Pre-trainng Dataset with 370B Tokens.☆89Updated 2 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆233Updated 2 weeks ago
- Nano repo for RL training of LLMs☆62Updated 2 weeks ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated 3 weeks ago