MUSA Templates for Linear Algebra Subroutines
☆45Jan 30, 2026Updated 2 months ago
Alternatives and similar repositories for mutlass
Users that are interested in mutlass are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- ☆46Jan 13, 2026Updated 3 months ago
- go-onedrive is a Go client library for accessing the Microsoft OneDrive API.☆10Dec 12, 2018Updated 7 years ago
- ☆10Sep 23, 2023Updated 2 years ago
- yolov5_obb C++ onnxruntime deployment☆10Mar 11, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🦙🦙.🦀☆28Sep 24, 2023Updated 2 years ago
- This repository was created to maintain "calltree", initial source code from calltree-2.3. "calltree" is a static call tree generator for…☆25Sep 14, 2019Updated 6 years ago
- torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics c…☆488Mar 17, 2026Updated 3 weeks ago
- ☆12Nov 19, 2020Updated 5 years ago
- ☆11Feb 13, 2025Updated last year
- Parallel implementation of bzip2 using cuda☆32Apr 21, 2011Updated 14 years ago
- a static analytical model for LLM distributed training☆131Jan 8, 2026Updated 3 months ago
- The Free Software Media System. 适用于Rockchip SoC 和 RTD1296 的 Jellyfin,请使用已编译的镜像 https://hub.docker.com/u/jjm2473☆16Jan 29, 2024Updated 2 years ago
- ☆13Sep 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆27Apr 25, 2024Updated last year
- 【不再维护】本仓库最初目的是为了减轻组内学生书写负担,特别是对于郑大电信学院,轻大李老师重头建立了新一版模版。这太好了,我就不需要花精力维护了,感谢机器人实验室/刘老师李老师,谢谢大家支持!☆13Mar 20, 2024Updated 2 years ago
- Code used in a short tutorial on LLVM passes for the Software Reliablity Group (SRG) at Imperial☆19Apr 26, 2015Updated 10 years ago
- Official repository for Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments☆16Jul 9, 2024Updated last year
- OpenCL Decoder work for libvpx' VP8 decoder.☆29Apr 13, 2017Updated 9 years ago
- Yet Another Aria2 Web Frontend in pure HTML/CSS/Javascirpt☆23Dec 30, 2015Updated 10 years ago
- ☆21May 19, 2023Updated 2 years ago
- study of cutlass☆22Nov 10, 2024Updated last year
- ☆14Apr 11, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Forward proxy plugin for the Caddy web server☆20Nov 29, 2022Updated 3 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆63Aug 26, 2013Updated 12 years ago
- 3D Tools for the Windows Presentation Foundation (WPF)☆12Dec 27, 2024Updated last year
- 9p for Windows, with TCP and Hyper-V Socket transport support☆27Feb 16, 2026Updated 2 months ago
- ☆50Jun 24, 2025Updated 9 months ago
- Guide I wrote mostly for myself on how to run mlc-llm on the Orange Pi 5 Pro☆23Aug 15, 2025Updated 8 months ago
- OpenCL memory benchmark☆15Dec 21, 2016Updated 9 years ago
- Examine and discover LoongArch instructions☆23Jul 11, 2025Updated 9 months ago
- LLVM IR syntax highlighting specification for Notepad++☆13Dec 27, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An implementation of http://www.cs.huji.ac.il/~danix/ShadowRemoval/☆14May 11, 2017Updated 8 years ago
- The occupancy grid mapping, based on the Bayesian Filter, for radar detections.☆14Jul 30, 2019Updated 6 years ago
- An implementation of memcpy for amd64 with clang/gcc☆14Feb 7, 2022Updated 4 years ago
- OpenCL compilation with clang compiler.☆27Mar 12, 2025Updated last year
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 4 months ago
- Parallel GMRES (Generalized Minimal Residual) linear solver on GPU platforms☆27Oct 5, 2015Updated 10 years ago
- RISC-V emulator in Zig☆15Nov 4, 2023Updated 2 years ago