FlagAI-Open / OpenSeekLinks

OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.

☆217

Alternatives and similar repositories for OpenSeek

Users that are interested in OpenSeek are comparing it to the libraries listed below

Sorting:

RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆200Updated last week
JT-Ushio / MHA2MLA
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
☆183Updated last month
SuperGPQA / SuperGPQA
☆157Updated 3 months ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆417Updated 2 months ago
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆204Updated this week
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated 3 weeks ago
step-law / steplaw
☆198Updated 3 months ago
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆238Updated 5 months ago
a-m-team / a-m-models
a-m-team's exploration in large language modeling
☆178Updated 2 months ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆336Updated 7 months ago
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆395Updated this week
inclusionAI / Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆181Updated 2 months ago
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 10 months ago
boson-ai / RPBench-Auto
An automated pipeline for evaluating LLMs for role-playing.
☆192Updated 10 months ago
Tongyi-Zhiwen / QwenLong-L1
☆287Updated 2 months ago
InternLM / InternBootcamp
☆173Updated last month
SkyworkAI / skywork-o1-prm-inference
☆65Updated 8 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆800Updated last month
Qihoo360 / Light-R1
☆733Updated 2 months ago
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆169Updated 3 weeks ago
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆85Updated 4 months ago
fxmeng / TransMLA
TransMLA: Multi-Head Latent Attention Is All You Need
☆335Updated 3 weeks ago
ADaM-BJTU / OpenRFT
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆146Updated 7 months ago
InternLM / OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆188Updated 4 months ago
llm-factory / LLaMA-Factory-Doc
LLaMA Factory Document
☆146Updated 2 weeks ago
QwenLM / AutoIF
☆298Updated last year
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆146Updated last year
microsoft / RedStone
The RedStone repository includes code for preparing extensive datasets used in training large language models.
☆136Updated last month
RyanLiu112 / compute-optimal-tts
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
☆268Updated 5 months ago
qiancheng0 / ToolRL
☆300Updated last month