mli / transformers-benchmarksLinks
real Transformer TeraFLOPS on various GPUs
☆913Updated last year
Alternatives and similar repositories for transformers-benchmarks
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
Sorting:
- pytorch distribute tutorials☆146Updated 2 months ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,498Updated 4 years ago
- Rotary Transformer☆1,009Updated 3 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,086Updated 7 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆408Updated 3 weeks ago
- A fast MoE impl for PyTorch☆1,777Updated 6 months ago
- How to use wandb?☆674Updated last year
- 整理 pytorch 单机多 GPU 训练方法与原理☆846Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,142Updated last week
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆155Updated 10 months ago
- ☆614Updated last year
- PyTorch Project Specification.☆681Updated 4 years ago
- Best practice for training LLaMA models in Megatron-LM☆660Updated last year
- A quickstart and benchmark for pytorch distributed training.☆1,668Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆410Updated this week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,298Updated last week
- huggingface mirror download☆585Updated 4 months ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆604Updated 2 months ago
- A brief of TorchScript by MNIST☆112Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,411Updated last year
- Cool Papers - Immersive Paper Discovery☆596Updated 2 months ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,498Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer☆481Updated 8 months ago
- personal chatgpt☆381Updated 8 months ago
- The record of what I‘ve been through.☆100Updated 7 months ago
- MindSpore online courses: Step into LLM☆477Updated this week
- Inference code for LLaMA models☆122Updated 2 years ago
- Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Tran…☆275Updated last year
- A pupil in the computer world.(Felix Fu)☆243Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆335Updated last year