aime-team / pytorch-benchmarksLinks
A benchmark framework for Pytorch
☆32Updated 10 months ago
Alternatives and similar repositories for pytorch-benchmarks
Users that are interested in pytorch-benchmarks are comparing it to the libraries listed below
Sorting:
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Updated last year
- MLPerf™ logging library☆38Updated last month
- Parallel framework for training and fine-tuning deep neural networks☆70Updated 3 months ago
- Example ML projects that use the Determined library.☆32Updated last year
- Quantize transformers to any learned arbitrary 4-bit numeric format☆50Updated 2 weeks ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Updated 3 years ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated last year
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆62Updated last month
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated last week
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆64Updated last month
- ☆64Updated 8 months ago
- A collection of reproducible inference engine benchmarks☆38Updated 9 months ago
- A calculator to estimate the memory footprint, capacity, and latency on VMware Private AI with NVIDIA.☆38Updated 6 months ago
- Pipeline parallelism for the minimalist☆40Updated 6 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆53Updated 11 months ago
- Gpu benchmark☆74Updated last year
- ☆71Updated 10 months ago
- Compressing Large Language Models using Low Precision and Low Rank Decomposition☆106Updated 2 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆45Updated 3 years ago
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- Benchmarks to capture important workloads.☆32Updated this week
- Unit Scaling demo and experimentation code☆16Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆170Updated last month
- ☆16Updated 2 months ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- ☆71Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆70Updated 9 months ago