HFAiLab / hfai-models
HFAI deep learning models
☆90Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hfai-models
- FireFlyer Record file format, writer and reader for DL training samples.☆116Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆209Updated this week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆123Updated this week
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆57Updated last year
- ATC23 AE☆43Updated last year
- ☆74Updated 11 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆126Updated 5 months ago
- The related works and background techniques about Openai o1☆144Updated 2 weeks ago
- ☆89Updated 7 months ago
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆362Updated this week
- Odysseus: Playground of LLM Sequence Parallelism☆57Updated 5 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆104Updated 11 months ago
- ☆64Updated 4 months ago
- An automated pipeline for evaluating LLMs for role-playing.☆137Updated 2 months ago
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆49Updated last year
- Low-bit optimizers for PyTorch☆119Updated last year
- PyTorch bindings for CUTLASS grouped GEMM.☆68Updated 4 months ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆41Updated 4 months ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆311Updated last year
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆151Updated 11 months ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆274Updated last year
- Official implementation of TransNormerLLM: A Faster and Better LLM☆229Updated 9 months ago
- Implementation of FlashAttention in PyTorch☆123Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆171Updated this week
- Triton implementation of Flash Attention2.0☆22Updated last year
- ☆28Updated 2 months ago
- A unified tokenization tool for Images, Chinese and English.☆150Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Updated 10 months ago
- The test of different distributed-training methods on High-Flyer AIHPC☆21Updated 2 years ago
- veRL: Volcano Engine Reinforcement Learning for LLM☆327Updated this week