groq / mlagilityLinks
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆40Updated 3 months ago
Alternatives and similar repositories for mlagility
Users that are interested in mlagility are comparing it to the libraries listed below
Sorting:
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- python package of rocm-smi-lib☆24Updated 4 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆15Updated 11 months ago
- ☆71Updated 7 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆114Updated 3 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated last week
- A Data-Centric Compiler for Machine Learning☆85Updated last year
- ☆120Updated last year
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 9 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 3 months ago
- MLPerf™ logging library☆37Updated last month
- PB-LLM: Partially Binarized Large Language Models☆156Updated 2 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 4 months ago
- LLM-Inference-Bench☆57Updated 4 months ago
- TORCH_LOGS parser for PT2☆64Updated last week
- ☆107Updated this week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆277Updated 2 years ago
- Benchmarks to capture important workloads.☆31Updated 9 months ago
- Benchmark suite for LLMs from Fireworks.ai☆83Updated this week
- ☆21Updated 8 months ago
- ☆28Updated 10 months ago
- High-Performance SGEMM on CUDA devices☆110Updated 9 months ago
- Prototype routines for GPU quantization written using PyTorch.☆21Updated 3 months ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆216Updated last week
- An innovative library for efficient LLM inference via low-bit quantization☆349Updated last year
- ☆218Updated 9 months ago
- How to ensure correctness and ship LLM generated kernels in PyTorch☆117Updated last week
- Training material for IPU users: tutorials, feature examples, simple applications☆87Updated 2 years ago
- Example ML projects that use the Determined library.☆32Updated last year