Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆241May 14, 2025Updated 10 months ago
Alternatives and similar repositories for Ling
Users that are interested in Ling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆107Aug 5, 2025Updated 7 months ago
- A high-performance kernel library for LLM training☆67Feb 11, 2026Updated last month
- The OlymMATH dataset☆24Jun 1, 2025Updated 9 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆4,855Updated this week
- Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI.☆265Oct 4, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,145Mar 19, 2026Updated last week
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 5 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆476May 17, 2025Updated 10 months ago
- Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.☆647Mar 17, 2026Updated last week
- ☆19Aug 4, 2025Updated 7 months ago
- An industrial extension library of pytorch to accelerate large scale model training☆59Aug 13, 2025Updated 7 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆743Jun 6, 2025Updated 9 months ago
- DLRover: An Automatic Distributed Deep Learning System☆1,641Mar 16, 2026Updated last week
- ☆1,113Jan 10, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fork of Flame repo for training of some new stuff in development☆19Mar 17, 2026Updated last week
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated last month
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated last month
- ☆214Dec 19, 2025Updated 3 months ago
- The official implementation of ReCap: Better Gaussian Relighting With Cross-Environment Captures☆18Jun 15, 2025Updated 9 months ago
- ☆75May 30, 2025Updated 9 months ago
- ☆51Mar 9, 2026Updated 2 weeks ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- Code and data for paper "(How) do Language Models Track State?"☆22Mar 31, 2025Updated 11 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆139Sep 4, 2025Updated 6 months ago
- ☆63Jun 12, 2025Updated 9 months ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- ☆18Mar 3, 2025Updated last year
- ☆19Sep 19, 2024Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- ☆68Mar 21, 2025Updated last year
- ☆84Apr 3, 2025Updated 11 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training☆262Aug 9, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 3 months ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆114Aug 15, 2025Updated 7 months ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆66Aug 3, 2025Updated 7 months ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆98Mar 5, 2026Updated 3 weeks ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last month