mli / transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
☆892Updated last year
Alternatives and similar repositories for transformers-benchmarks:
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,044Updated 3 weeks ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,389Updated 3 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆397Updated 2 months ago
- Rotary Transformer☆858Updated 2 years ago
- ☆598Updated 7 months ago
- pytorch distribute tutorials☆97Updated 3 months ago
- How to use wandb?☆604Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆149Updated 3 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆798Updated this week
- Efficient Training (including pre-training and fine-tuning) for Big Models☆574Updated 5 months ago
- 更纯粹、更高压缩率的Tokenizer☆468Updated last month
- Cool Papers - Immersive Paper Discovery☆447Updated last month
- Best practice for training LLaMA models in Megatron-LM☆638Updated last year
- 整理 pytorch 单机多 GPU 训练方法与原理☆791Updated 3 years ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,385Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,957Updated 3 weeks ago
- A fast MoE impl for PyTorch☆1,596Updated 6 months ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,054Updated 5 months ago
- RoFormer V1 & V2 pytorch☆482Updated 2 years ago
- Train a 1B LLM with 1T tokens from scratch by personal☆469Updated this week
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,009Updated 3 months ago
- PyTorch Project Specification.☆672Updated 3 years ago
- personal chatgpt☆334Updated last month
- Efficient Inference for Big Models☆574Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆290Updated this week
- ☆247Updated 8 months ago
- huggingface mirror download☆557Updated 2 months ago
- 利用HuggingFace的官方下载工具从镜像网站进行高速下载。☆921Updated 3 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,650Updated 5 months ago