HFAiLab / hfai-modelsLinks

HFAI deep learning models

☆153

Alternatives and similar repositories for hfai-models

Users that are interested in hfai-models are comparing it to the libraries listed below

Sorting:

Ascend / AscendSpeed
☆79Updated last year
SkyworkAI / Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆137Updated last year
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆433Updated this week
HFAiLab / ffrecord
FireFlyer Record file format, writer and reader for DL training samples.
☆235Updated 2 years ago
zms1999 / SmartMoE
A MoE impl for PyTorch, [ATC'23] SmartMoE
☆71Updated 2 years ago
OpenRL-Lab / Ray_Tutorial
Tutorial for Ray
☆30Updated last year
HFAiLab / hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
☆707Updated 2 years ago
qingkelab / qingketalk
青稞Talk
☆156Updated this week
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆323Updated 6 months ago
GPT-Fathom / GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…
☆347Updated last year
intelligent-machine-learning / atorch
An industrial extension library of pytorch to accelerate large scale model training
☆49Updated 2 months ago
sii-research / siiRL
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
☆222Updated this week
Oneflow-Inc / libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
☆407Updated 2 months ago
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
☆248Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated last year
omni-ai-npu / omni-infer
Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature se…
☆80Updated this week
ISEEKYAN / mbridge
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
☆142Updated this week
volcengine / veGiantModel
☆219Updated 2 years ago
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆260Updated 8 months ago
SomeoneKong / llm_long_context_bench202405
☆29Updated last year
rlite-project / RLite
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…
☆68Updated 2 months ago
jackfsuia / nanoRLHF
RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.
☆73Updated 8 months ago
InternLM / InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…
☆410Updated 2 months ago
DeepLink-org / dlinfer
☆64Updated this week
OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆612Updated 2 months ago
FlagOpen / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆364Updated this week
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
OpenNLPLab / lightning-attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆330Updated 8 months ago
alipay / PainlessInferenceAcceleration
Accelerate inference without tears
☆361Updated last week
THUDM / FasterTransformer
Transformer related optimization, including BERT, GPT
☆39Updated 2 years ago