mli / transformers-benchmarksLinks

real Transformer TeraFLOPS on various GPUs

☆911

Alternatives and similar repositories for transformers-benchmarks

Users that are interested in transformers-benchmarks are comparing it to the libraries listed below

Sorting:

godweiyang / NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
☆1,488Updated 4 years ago
THUDM / SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
☆1,086Updated 7 months ago
jia-zhuang / pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
☆842Updated 3 years ago
OpenRL-Lab / Wandb_Tutorial
How to use wandb?
☆668Updated last year
laekov / fastmoe
A fast MoE impl for PyTorch
☆1,766Updated 5 months ago
ZhuiyiTechnology / roformer
Rotary Transformer
☆996Updated 3 years ago
chunhuizhang / pytorch_distribute_tutorials
pytorch distribute tutorials
☆143Updated last month
Oneflow-Inc / libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
☆409Updated last week
firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆155Updated 9 months ago
bojone / papers.cool
Cool Papers - Immersive Paper Discovery
☆584Updated last month
tczhangzhi / pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
☆1,668Updated last year
Ascend / pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
☆395Updated this week
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
mlc-ai / mlc-zh
☆611Updated last year
OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆604Updated 2 months ago
DeepVAC / deepvac
PyTorch Project Specification.
☆680Updated 3 years ago
bojone / bytepiece
更纯粹、更高压缩率的Tokenizer
☆481Updated 8 months ago
miracleyoo / pytorch-lightning-template
An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板，简单易用，稍改原来Pytorch代码，即可适配Lightning。You can translate your previous Pytorch code much…
☆1,489Updated last year
git-cloner / aliendao
huggingface mirror download
☆584Updated 3 months ago
deepspeedai / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆2,123Updated 2 weeks ago
Tencent / TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
☆1,078Updated 11 months ago
FelixFu520 / README
A pupil in the computer world.(Felix Fu)
☆241Updated last year
cnstark / gputasker
An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star
☆188Updated 2 years ago
alibaba / Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,258Updated 3 weeks ago
MrYxJ / calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer…
☆836Updated last year
sxontheway / Keep-Learning
The record of what I‘ve been through.
☆100Updated 6 months ago
open-mmlab / mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
☆1,350Updated last month
thunlp / OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
☆1,032Updated 10 months ago
microsoft / Tutel
Tutel MoE: Optimized Mixture-of-Experts Library, Support DeepSeek/Kimi-K2/Qwen3 FP8/FP4
☆870Updated last week
OpenBMB / BMInf
Efficient Inference for Big Models
☆585Updated 2 years ago