CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
☆40Nov 19, 2023Updated 2 years ago
Alternatives and similar repositories for CuPBoP-AMD
Users that are interested in CuPBoP-AMD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆144Jan 3, 2025Updated last year
- ☆17Oct 9, 2023Updated 2 years ago
- Cluster simulator with far memory☆12Apr 28, 2020Updated 5 years ago
- Environment control for benchmarks☆14Feb 10, 2025Updated last year
- A prototype tool to measure the bandwidth of a ROS 2 topic with minimal CPU overhead☆19Oct 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Computational Memory Neural Network Compiler☆11Aug 11, 2021Updated 4 years ago
- TLUT tool flow for parameterised configurations for FPGAs☆16Aug 5, 2024Updated last year
- ☆10Mar 14, 2018Updated 8 years ago
- Generate Zynq configurations without using the vendor GUI☆30Jul 5, 2023Updated 2 years ago
- This GitHub repo contains the artifact for CPElide, which appears at MICRO '24☆15Sep 7, 2024Updated last year
- ☆11Dec 31, 2019Updated 6 years ago
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated last year
- HWASim is a simulator for heterogeneous systems with CPUs and Hardware Accelerators (HWAs). It is released with the DASH memory scheduler…☆19Jan 11, 2016Updated 10 years ago
- ☆13Nov 25, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Verification environment for the OpenHW Group's CORE-V High Performance Data Cache controller.☆24Jan 6, 2026Updated 3 months ago
- 32-bit RISC-V based processor with memory controler☆16Sep 2, 2022Updated 3 years ago
- A Hardware Implemented Poseidon Hasher☆20Apr 15, 2022Updated 4 years ago
- Linux环境下的文档在线预览,支持公式☆11Jul 3, 2018Updated 7 years ago
- Linpack: configuration, install, optimization☆16Jul 3, 2019Updated 6 years ago
- A Google images scraper to collect a labeled face dataset.☆11Oct 24, 2018Updated 7 years ago
- ☆13Oct 11, 2025Updated 6 months ago
- Unstructured computations on emerging architectures.☆14Jun 1, 2022Updated 3 years ago
- Low-level API examples using C++, C#, Rust, Python, Swift, Java, and Kotlin languages☆22Nov 11, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- UPP is a minimalist and generic text preprocessor using Lua macros.☆13Oct 13, 2024Updated last year
- Generic plugin architecture for message transport☆21Feb 7, 2013Updated 13 years ago
- 关于深度学习算法、框架、编译器、加速器的一些理解☆16Jul 2, 2022Updated 3 years ago
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 6 months ago
- ☆15Feb 18, 2025Updated last year
- seeta face detection for Android☆11Sep 23, 2017Updated 8 years ago
- WorldPalette is a Maya plugin based on the 2015 SIGGRAPH paper, WorldBrush: Interactive Example-based Synthesis of Procedural Virtual Wor…☆11May 10, 2021Updated 4 years ago
- This repository contains the 3D face reconstruction results from a single image.☆16Jun 14, 2018Updated 7 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆117Jul 22, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MLIR backend for optimising graph algorithms☆17Mar 30, 2024Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- HIP Python Low-level Bindings☆35Feb 12, 2026Updated 2 months ago
- Voxelizing the scene by filling information into structured buffers of 8 different cascades and performing cone tracing to traverse throu…☆13Jul 6, 2017Updated 8 years ago
- ☆11Nov 3, 2022Updated 3 years ago
- Voxel rendering engine for master thesis (voxel skeletal animation)☆13Oct 20, 2024Updated last year
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆20Mar 24, 2023Updated 3 years ago