CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
☆40Nov 19, 2023Updated 2 years ago
Alternatives and similar repositories for CuPBoP-AMD
Users that are interested in CuPBoP-AMD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆150Jan 3, 2025Updated last year
- Environment control for benchmarks☆14Feb 10, 2025Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Computational Memory Neural Network Compiler☆11Aug 11, 2021Updated 4 years ago
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆55Nov 21, 2019Updated 6 years ago
- Generate Zynq configurations without using the vendor GUI☆30Jul 5, 2023Updated 2 years ago
- This GitHub repo contains the artifact for CPElide, which appears at MICRO '24☆16Sep 7, 2024Updated last year
- ☆10Aug 10, 2018Updated 7 years ago
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Jan 12, 2024Updated 2 years ago
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated 2 years ago
- HWASim is a simulator for heterogeneous systems with CPUs and Hardware Accelerators (HWAs). It is released with the DASH memory scheduler…☆19Jan 11, 2016Updated 10 years ago
- ☆19Dec 7, 2020Updated 5 years ago
- Verification environment for the OpenHW Group's CORE-V High Performance Data Cache controller.☆26Jan 6, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An MLIR-based toy DL compiler for TVM Relay.☆62Oct 16, 2022Updated 3 years ago
- Tensorflow 2 code for several U-Net variants to perform direct comparisons including base, attention, dense, ++, squeeze-excite, inceptio…☆15Mar 22, 2022Updated 4 years ago
- A Hardware Implemented Poseidon Hasher☆20Apr 15, 2022Updated 4 years ago
- A Google images scraper to collect a labeled face dataset.☆11Oct 24, 2018Updated 7 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆21Jul 13, 2025Updated 11 months ago
- ☆41Jan 23, 2024Updated 2 years ago
- UPP is a minimalist and generic text preprocessor using Lua macros.☆13Oct 13, 2024Updated last year
- seeta face detection for Android☆11Sep 23, 2017Updated 8 years ago
- DATE'24 paper: "Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs"☆21Dec 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unstructured computations on emerging architectures.☆17Jun 1, 2022Updated 4 years ago
- A procedurally generated city rendered with D3D12 and Vulkan☆13Feb 4, 2020Updated 6 years ago
- WorldPalette is a Maya plugin based on the 2015 SIGGRAPH paper, WorldBrush: Interactive Example-based Synthesis of Procedural Virtual Wor…☆11May 10, 2021Updated 5 years ago
- An implementation of Lz77 compression algorithm on FPGA using MaxCompiler programming tool.☆10Sep 4, 2015Updated 10 years ago
- six-voice sampler and sequencer for monome norns☆48May 30, 2026Updated 2 weeks ago
- Plugin to display surfel clouds in the ROS visualizer RViz☆13Aug 8, 2019Updated 6 years ago
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 5 years ago
- LPC (Local Procedure Call) is a portion of Windows NT kernel, used for fast communication between threads or processes. It can be also us…☆17Mar 21, 2021Updated 5 years ago
- Systemd Robot Initialization☆13Oct 31, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MLIR backend for optimising graph algorithms☆17Mar 30, 2024Updated 2 years ago
- Fluid simulation via an optimised implementation of the FLIP (FLuid Implicit Particle) algortihm.☆14May 20, 2022Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- HIP Python Low-level Bindings☆39Apr 15, 2026Updated 2 months ago
- ☆11Nov 3, 2022Updated 3 years ago
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆21Mar 24, 2023Updated 3 years ago
- ✅ GitHub Check Runs Action☆15Jan 5, 2023Updated 3 years ago