mli / transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
☆899Updated last year
Alternatives and similar repositories for transformers-benchmarks:
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,073Updated 3 months ago
- Rotary Transformer☆932Updated 3 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,455Updated 3 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆401Updated 3 weeks ago
- ☆607Updated 10 months ago
- Best practice for training LLaMA models in Megatron-LM☆649Updated last year
- A fast MoE impl for PyTorch☆1,699Updated 2 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆998Updated this week
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆152Updated 6 months ago
- PyTorch Project Specification.☆679Updated 3 years ago
- 更纯粹、更高压缩率的Tokenizer☆474Updated 4 months ago
- huggingface mirror download☆572Updated 2 weeks ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆328Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,044Updated 3 weeks ago
- Tutel MoE: Optimized Mixture-of-Experts Library, Support DeepSeek FP8/FP4☆800Updated this week
- Efficient Training (including pre-training and fine-tuning) for Big Models☆582Updated 8 months ago
- How to use wandb?☆634Updated last year
- 看图学大模型☆287Updated 8 months ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆811Updated 3 years ago
- how to optimize some algorithm in cuda.☆2,090Updated last week
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆96Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆263Updated last week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆312Updated 8 months ago
- RoFormer V1 & V2 pytorch☆492Updated 2 years ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,619Updated last year
- pytorch distribute tutorials☆122Updated last month
- The pure and clear PyTorch Distributed Training Framework.☆276Updated last year
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,023Updated 6 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,658Updated 8 months ago
- Cool Papers - Immersive Paper Discovery☆517Updated 2 weeks ago