HFAiLab / hfai-models
HFAI deep learning models
☆87Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hfai-models
- FireFlyer Record file format, writer and reader for DL training samples.☆116Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆116Updated last month
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆353Updated last week
- The related works and background techniques about Openai o1☆138Updated this week
- ATC23 AE☆43Updated last year
- veRL: Volcano Engine Reinforcement Learning for LLM☆290Updated last week
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆57Updated last year
- ☆198Updated this week
- ☆179Updated 11 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆390Updated this week
- A unified tokenization tool for Images, Chinese and English.☆150Updated last year
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆48Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆134Updated last month
- Official implementation of TransNormerLLM: A Faster and Better LLM☆229Updated 9 months ago
- ☆74Updated 10 months ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆308Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆127Updated 5 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆105Updated 10 months ago
- Rectified Rotary Position Embeddings☆339Updated 5 months ago
- Odysseus: Playground of LLM Sequence Parallelism☆55Updated 4 months ago
- AI Alignment: A Comprehensive Survey☆127Updated last year
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆349Updated 7 months ago
- ☆149Updated 3 weeks ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆83Updated 2 months ago
- Low-bit optimizers for PyTorch☆118Updated last year
- ☆89Updated 7 months ago
- The official repo of INF-34B models trained by INF Technology.☆34Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆295Updated 3 weeks ago
- [ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…☆98Updated 4 months ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆184Updated 6 months ago