huggingface / optimum-amdLinks
AMD related optimizations for transformer models
☆97Updated 3 months ago
Alternatives and similar repositories for optimum-amd
Users that are interested in optimum-amd are comparing it to the libraries listed below
Sorting:
- An innovative library for efficient LLM inference via low-bit quantization☆352Updated last year
- No-code CLI designed for accelerating ONNX workflows☆227Updated 7 months ago
- Fast and memory-efficient exact attention☆214Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Updated 2 months ago
- ☆219Updated last year
- ☆172Updated last week
- A safetensors extension to efficiently store sparse quantized tensors on disk☆238Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Updated 7 months ago
- ☆120Updated last year
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆327Updated 4 months ago
- ☆163Updated 7 months ago
- Explore training for quantized models☆26Updated 6 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆205Updated last week
- 8-bit CUDA functions for PyTorch☆70Updated 4 months ago
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆184Updated 10 months ago
- ☆118Updated last month
- ☆79Updated last year
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆532Updated this week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆74Updated last year
- Google TPU optimizations for transformers models☆134Updated 2 weeks ago
- ☆206Updated 9 months ago
- python package of rocm-smi-lib☆24Updated last month
- llama.cpp to PyTorch Converter☆37Updated last year
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆92Updated this week
- GPTQ inference Triton kernel☆321Updated 2 years ago
- Repository of model demos using TT-Buda☆63Updated 10 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 10 months ago
- 👷 Build compute kernels☆215Updated 2 weeks ago
- Development repository for the Triton language and compiler☆140Updated last week