AMD related optimizations for transformer models
☆100Apr 3, 2026Updated last week
Alternatives and similar repositories for optimum-amd
Users that are interested in optimum-amd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Github action to connect to tailscale☆20Mar 10, 2026Updated last month
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- Super fast FP32 matrix multiplication on RDNA3☆87Mar 30, 2025Updated last year
- Google TPU optimizations for transformers models☆136Jan 23, 2026Updated 2 months ago
- Wave: Python Domain-Specific Language for High Performance Machine Learning☆53Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A course based on FINN with hands on Lectures, Examples and Labs to go from 0 to a full custom Quantized Neural Network running on your v…☆44Jul 3, 2025Updated 9 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆96Apr 8, 2026Updated last week
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Mirror for Java and PHP libraries and text resources to facilitate the use of Inuktitut in its written form on computers and the web☆10Aug 2, 2015Updated 10 years ago
- Repository for work on on Xilinx's matrix vector activation unit's RTL implementation. Documentation available at: https://asadalam.githu…☆20Jan 21, 2022Updated 4 years ago
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆334Apr 3, 2026Updated last week
- A pytorch quantization backend for optimum☆1,036Apr 2, 2026Updated 2 weeks ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆39Dec 2, 2025Updated 4 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated last year
- ☆22Apr 7, 2026Updated last week
- User-mode trap-and-emulate hypervisor for RISC-V☆14Feb 11, 2022Updated 4 years ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆209Apr 3, 2026Updated last week
- Xen hypercall and interfaces in Rust☆16Jan 14, 2025Updated last year
- An open source branch of AIE API☆14Apr 30, 2025Updated 11 months ago
- Ongoing research training transformer models at scale☆39Apr 9, 2026Updated last week
- ☆33Feb 3, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 10, 2016Updated 9 years ago
- Document Automation Reference Kit☆16Jun 27, 2024Updated last year
- A Windows window manager investigation toolbox☆14Mar 17, 2022Updated 4 years ago
- InnerEye dataset creation tool for InnerEye-DeepLearning library. Transforms DICOM data into mask for training Deep Learning models.☆21Mar 21, 2024Updated 2 years ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Automatically derive Python dunder methods for your Rust code☆25Apr 7, 2026Updated last week
- A huge dataset for Document Visual Question Answering☆21Jul 29, 2024Updated last year
- ☆20Aug 1, 2024Updated last year
- Fast and memory-efficient exact attention☆228Apr 9, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,354Apr 2, 2026Updated 2 weeks ago
- A lightweight triton-based General Matrix Multiplication (GEMM) library.☆57Apr 8, 2026Updated last week
- A repository to compose tweets together for @StarshipPrompt☆28Apr 12, 2020Updated 6 years ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- Tools for manipulating Qualcomm XBL images☆26Jan 18, 2024Updated 2 years ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆76Dec 25, 2024Updated last year
- Development repository for the Triton language and compiler☆144Updated this week