mli / transformers-benchmarksLinks
real Transformer TeraFLOPS on various GPUs
☆915Updated last year
Alternatives and similar repositories for transformers-benchmarks
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
Sorting:
- pytorch distribute tutorials☆151Updated 2 months ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,086Updated 8 months ago
- Rotary Transformer☆1,020Updated 3 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,509Updated 4 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆408Updated last month
- A fast MoE impl for PyTorch☆1,782Updated 7 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆157Updated 10 months ago
- Best practice for training LLaMA models in Megatron-LM☆661Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,159Updated 3 weeks ago
- ☆616Updated last year
- How to use wandb?☆675Updated 2 years ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆848Updated 3 years ago
- PyTorch Project Specification.☆681Updated 4 years ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,327Updated last week
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆426Updated this week
- Efficient Training (including pre-training and fine-tuning) for Big Models☆606Updated 2 weeks ago
- A pupil in the computer world.(Felix Fu)☆243Updated last year
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,077Updated last year
- huggingface mirror download☆586Updated 5 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,666Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,415Updated last year
- 更纯粹、更高压缩率的Tokenizer☆482Updated 9 months ago
- Cool Papers - Immersive Paper Discovery☆611Updated 2 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆727Updated 4 months ago
- Inference code for LLaMA models☆123Updated 2 years ago
- personal chatgpt☆384Updated 8 months ago
- The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models☆373Updated 3 years ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much …☆1,509Updated 2 years ago
- 看图学大模型☆317Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated last year