Repository to host ROCm Developer Hub Notebook Tutorials
☆78May 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for gpuaidev
Users that are interested in gpuaidev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Apr 7, 2026Updated last month
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated 3 weeks ago
- A comprehensive framework to test audio comprehension of Large Audio Language Models.☆63Apr 29, 2026Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆30May 4, 2026Updated 2 weeks ago
- ☆15Feb 2, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Updated this week
- CSC Training: High-Level GPU Programming☆14Oct 16, 2025Updated 7 months ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Feb 20, 2025Updated last year
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆69Dec 18, 2025Updated 5 months ago
- Solution to Problems in Quantum Field Theory by Franz Mandl & Graham Shaw☆13Oct 4, 2017Updated 8 years ago
- A Linux kernel module, that allows changing/toggling system parameters stored in MSR and PCI registers of x86 processors☆16Mar 29, 2023Updated 3 years ago
- ☆13Mar 16, 2018Updated 8 years ago
- Talleres de la Segunda Escuela de Computación Cuántica, Pontificia Universidad Católica de Chile, Santiago de Chile, 2024☆18Mar 3, 2024Updated 2 years ago
- Python library for hyperspectral analysis focused on spectroscopic approach.☆18Sep 9, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Step by step implementation of a fast softmax kernel in CUDA☆67Jan 6, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- Accelerated computing with HIP☆29May 14, 2026Updated last week
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆28Oct 26, 2023Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆13May 4, 2026Updated 2 weeks ago
- Lime sample projects☆14Jan 2, 2025Updated last year
- A BUDE virtual-screening benchmark, in many programming models☆31Oct 15, 2024Updated last year
- ☆27Mar 23, 2026Updated last month
- ☆29Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Mar 12, 2025Updated last year
- ☆96Nov 11, 2025Updated 6 months ago
- ROCm Documentation Python package for ReadTheDocs build standardization☆17Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆90Mar 30, 2025Updated last year
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆13Apr 28, 2026Updated 3 weeks ago
- ☆20Apr 24, 2026Updated 3 weeks ago
- A Phaser 3 Project Template that uses a custom build of Phaser☆11Jan 1, 2023Updated 3 years ago
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆24May 8, 2026Updated last week
- A repository of codelabs and tutorials to support education in scientific computing☆28Dec 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- ☆76May 14, 2026Updated last week
- subtitles for CppCon2015 2016 .....☆15Nov 2, 2015Updated 10 years ago
- Proxy and universal translation server to Argo API, OpenAI (Chat+Responses), Anthropic (Messages), Google (GenAI) format compatible☆21Apr 26, 2026Updated 3 weeks ago
- LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs☆29May 31, 2025Updated 11 months ago
- An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models☆49Jan 23, 2026Updated 3 months ago
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year