huggingface / optimum-amdLinks
AMD related optimizations for transformer models
☆97Updated 3 months ago
Alternatives and similar repositories for optimum-amd
Users that are interested in optimum-amd are comparing it to the libraries listed below
Sorting:
- An innovative library for efficient LLM inference via low-bit quantization☆352Updated last year
- No-code CLI designed for accelerating ONNX workflows☆227Updated 7 months ago
- Fast and memory-efficient exact attention☆214Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Updated 2 months ago
- ☆219Updated last year
- ☆172Updated last week
- A safetensors extension to efficiently store sparse quantized tensors on disk☆238Updated last week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Updated 7 months ago