aime-team / pytorch-benchmarksLinks
A benchmark framework for Pytorch
☆31Updated 6 months ago
Alternatives and similar repositories for pytorch-benchmarks
Users that are interested in pytorch-benchmarks are comparing it to the libraries listed below
Sorting:
- Example ML projects that use the Determined library.☆32Updated last year
- MLPerf™ logging library☆37Updated last week
- A parallel framework for training deep neural networks☆63Updated 6 months ago
- A collection of reproducible inference engine benchmarks☆33Updated 5 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 9 months ago
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Updated 2 years ago
- Train, tune, and infer Bamba model☆132Updated 3 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆11Updated this week
- ☆74Updated 5 months ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆34Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆63Updated 5 months ago
- LLM training in simple, raw C/CUDA☆104Updated last year
- Implementation of a methodology that allows all sorts of user defined GPU kernel fusion, for non CUDA programmers.☆25Updated this week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆33Updated last week
- Benchmark suite for LLMs from Fireworks.ai☆83Updated 2 weeks ago
- Gpu benchmark☆68Updated 7 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆12Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆132Updated 2 weeks ago
- Make triton easier☆47Updated last year
- python package of rocm-smi-lib☆23Updated 2 months ago
- Lightning Training strategy for HiveMind☆18Updated 2 weeks ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆107Updated 2 years ago
- Torch Distributed Experimental☆117Updated last year
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆212Updated this week
- AMD HPC Research Fund Cloud☆15Updated last month
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆48Updated 7 months ago
- Example of applying CUDA graphs to LLaMA-v2☆12Updated 2 years ago