Qihoo360 / 360-LLaMA-FactoryLinks

adds Sequence Parallelism into LLaMA-Factory

☆538

Alternatives and similar repositories for 360-LLaMA-Factory

Users that are interested in 360-LLaMA-Factory are comparing it to the libraries listed below

Sorting:

pat-jj / DeepRetrieval
[COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning
☆601Updated last month
Simple-Efficient / RL-Factory
Train your Agent model via our easy and efficient framework
☆1,317Updated this week
RLHFlow / Online-DPO-R1
Codebase for Iterative DPO Using Rule-based Rewards
☆255Updated 3 months ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆251Updated 2 weeks ago
ChenmienTan / RL2
☆600Updated this week
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆349Updated last month
Zefan-Cai / KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models
☆1,219Updated 7 months ago
uclaml / SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
☆574Updated 6 months ago
HJYao00 / Mulberry
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
☆1,208Updated 4 months ago
HKUDS / SepLLM
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
☆532Updated last week
mlpod / OpenSFT
☆45Updated 4 months ago
yfzhang114 / r1_reward
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
☆246Updated 2 months ago
PKU-YuanGroup / Machine-Mindset
An MBTI Exploration of Large Language Models
☆493Updated last year
dhcode-cpp / X-R1
minimal-cost for training 0.5B R1-Zero
☆765Updated 2 months ago
URSA-MATH / URSA-MATH
☆63Updated 4 months ago
Alpha-Innovator / DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆142Updated 6 months ago
Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆265Updated 4 months ago
LZY-the-boys / Twin-Merging
[NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
☆136Updated 4 months ago
longyuewangdcu / Chinese-Llama-2
improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.
☆448Updated last year
jordddan / Pruning-LLMs
The framework to prune LLMs to any size and any config.
☆94Updated last year
HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
☆745Updated this week
PKU-Alignment / align-anything
Align Anything: Training All-modality Model with Feedback
☆4,402Updated 2 months ago
Zefan-Cai / R-KV
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
☆1,097Updated last month
xinghaow99 / BitStack
[ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
☆36Updated 5 months ago
Alpha-Innovator / SurveyForge
(ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…
☆277Updated last month
gersteinlab / ML-Bench
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆302Updated this week
bird-bench / BIRD-CRITIC-1
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
☆766Updated 3 weeks ago
FanbinLu / STEVE-R1
R1-like Computer-use Agent
☆80Updated 4 months ago
luo-junyu / Awesome-Agent-Papers
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
☆1,327Updated 3 weeks ago
codefuse-ai / CodeFuse-CGM
☆359Updated last month