HFAiLab / hfai-models
HFAI deep learning models
☆147Updated last year
Alternatives and similar repositories for hfai-models:
Users that are interested in hfai-models are comparing it to the libraries listed below
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆634Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆131Updated 10 months ago
- FireFlyer Record file format, writer and reader for DL training samples.☆214Updated 2 years ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆187Updated 2 months ago
- ☆78Updated last year
- ☆214Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆342Updated 2 months ago
- VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework☆304Updated 2 weeks ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆287Updated this week
- ☆179Updated last week
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆477Updated this week
- ☆318Updated 9 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆242Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆268Updated this week
- An industrial extension library of pytorch to accelerate large scale model training☆32Updated 2 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆402Updated this week
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆49Updated 5 months ago
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- Best practice for training LLaMA models in Megatron-LM☆649Updated last year
- ☆107Updated 5 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆173Updated last month
- Materials for learning SGLang☆387Updated last month
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆382Updated this week
- ☆41Updated 5 months ago
- The test of different distributed-training methods on High-Flyer AIHPC☆24Updated 2 years ago
- 🥇 A curated list of awesome large language models in finance(FinLLMs), including papers,models,datasets and codebases. 金融大模型列表,特别是中英双语大模…☆40Updated 3 months ago
- Ring attention implementation with flash attention☆743Updated 2 weeks ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆55Updated 2 months ago
- ☆29Updated 7 months ago
- 配合 HAI Platform 使用的集成化用户界面☆49Updated last year