mli / transformers-benchmarksLinks
real Transformer TeraFLOPS on various GPUs
☆916Updated last year
Alternatives and similar repositories for transformers-benchmarks
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
Sorting:
- Rotary Transformer☆1,039Updated 3 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,513Updated 4 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,092Updated 9 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆407Updated 2 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆159Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆439Updated last month
- 整理 pytorch 单机多 GPU 训练方法与原理☆849Updated 3 years ago
- pytorch distribute tutorials☆153Updated 4 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,176Updated 2 months ago
- A fast MoE impl for PyTorch☆1,806Updated 8 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,664Updated last year
- ☆617Updated last year
- 更纯粹、更高压缩率的Tokenizer☆485Updated 10 months ago
- PyTorch Project Specification.☆681Updated 4 years ago
- Best practice for training LLaMA models in Megatron-LM☆660Updated last year
- A brief of TorchScript by MNIST☆112Updated 3 years ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,395Updated this week
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,081Updated last year
- How to use wandb?☆680Updated 2 years ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆611Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆740Updated 5 months ago
- Cool Papers - Immersive Paper Discovery☆630Updated last month
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,037Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,862Updated last year
- huggingface mirror download☆585Updated 6 months ago
- RoFormer V1 & V2 pytorch☆514Updated 3 years ago
- The record of what I‘ve been through.☆101Updated 9 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆344Updated last year
- A pupil in the computer world.(Felix Fu)☆244Updated last year
- The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models☆376Updated 3 years ago