mli / transformers-benchmarksLinks
real Transformer TeraFLOPS on various GPUs
☆911Updated last year
Alternatives and similar repositories for transformers-benchmarks
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
Sorting:
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,488Updated 4 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,086Updated 7 months ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆842Updated 3 years ago
- How to use wandb?☆668Updated last year
- A fast MoE impl for PyTorch☆1,766Updated 5 months ago
- Rotary Transformer☆996Updated 3 years ago
- pytorch distribute tutorials☆143Updated last month
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆409Updated last week
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆155Updated 9 months ago
- Cool Papers - Immersive Paper Discovery☆584Updated last month
- A quickstart and benchmark for pytorch distributed training.☆1,668Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆395Updated this week
- Best practice for training LLaMA models in Megatron-LM☆659Updated last year
- ☆611Updated last year
- Efficient Training (including pre-training and fine-tuning) for Big Models☆604Updated 2 months ago
- PyTorch Project Specification.☆680Updated 3 years ago
- 更纯粹、更高压缩率的Tokenizer☆481Updated 8 months ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,489Updated last year
- huggingface mirror download☆584Updated 3 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,123Updated 2 weeks ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,078Updated 11 months ago
- A pupil in the computer world.(Felix Fu)☆241Updated last year
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆188Updated 2 years ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,258Updated 3 weeks ago
- The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer…☆836Updated last year
- The record of what I‘ve been through.☆100Updated 6 months ago
- OpenMMLab Foundational Library for Training Deep Learning Models☆1,350Updated last month
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,032Updated 10 months ago
- Tutel MoE: Optimized Mixture-of-Experts Library, Support DeepSeek/Kimi-K2/Qwen3 FP8/FP4☆870Updated last week
- Efficient Inference for Big Models☆585Updated 2 years ago