mli / transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
☆894Updated last year
Alternatives and similar repositories for transformers-benchmarks:
Users that are interested in transformers-benchmarks are comparing it to the libraries listed below
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,396Updated 3 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆398Updated 3 weeks ago
- ☆598Updated 8 months ago
- How to use wandb?☆608Updated last year
- pytorch distribute tutorials☆103Updated this week
- Rotary Transformer☆886Updated 2 years ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆151Updated 4 months ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆797Updated 3 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,053Updated last month
- A quickstart and benchmark for pytorch distributed training.☆1,651Updated 6 months ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆573Updated 6 months ago
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,200Updated last year
- Best practice for training LLaMA models in Megatron-LM☆642Updated last year
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆276Updated last year
- 更纯粹、更高压缩率的Tokenizer☆471Updated 2 months ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,057Updated 6 months ago
- personal chatgpt☆337Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆522Updated this week
- Pytorch❤️ Keras 😋😋☆1,869Updated 3 months ago
- ☆249Updated 9 months ago
- Cool Papers - Immersive Paper Discovery☆463Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆296Updated 6 months ago
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,662Updated 8 months ago
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,412Updated 8 months ago
- 该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题☆1,694Updated last month
- A fast MoE impl for PyTorch☆1,621Updated this week
- huggingface mirror download☆560Updated 3 months ago
- An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much…☆1,395Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,980Updated last week