mli / transformers-benchmarksLinks
real Transformer TeraFLOPS on various GPUs
☆917Updated 2 years ago
Alternatives and similar repositories for transformers-benchmarks
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
Sorting:
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,112Updated last year
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,525Updated 4 years ago
- Rotary Transformer☆1,077Updated 3 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Updated 6 months ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆862Updated 4 years ago
- A fast MoE impl for PyTorch☆1,831Updated 11 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,666Updated last year
- pytorch distribute tutorials☆170Updated 7 months ago
- How to use wandb?☆692Updated 2 years ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆160Updated last year
- ☆625Updated last month
- Efficient Training (including pre-training and fine-tuning) for Big Models☆618Updated 3 months ago
- Best practice for training LLaMA models in Megatron-LM☆664Updated 2 years ago
- PyTorch Project Specification.☆679Updated 4 years ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆485Updated this week
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,539Updated 2 years ago
- huggingface mirror download☆590Updated 10 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,224Updated 5 months ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,092Updated last year
- 更纯粹、更高压缩率的Tokenizer☆490Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆418Updated last year
- Cool Papers - Immersive Paper Discovery☆701Updated 5 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,524Updated last month
- Efficient Inference for Big Models☆585Updated 3 years ago
- 从底层机理了解Transformer☆27Updated 3 years ago
- Inference code for LLaMA models☆128Updated 2 years ago
- RoFormer V1 & V2 pytorch☆519Updated 3 years ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆371Updated last year
- The record of what I‘ve been through. Now moved to Notion. See link below☆103Updated last year
- 看图学大模型☆316Updated last year