MUSA Templates for Linear Algebra Subroutines
☆44Jan 30, 2026Updated last month
Alternatives and similar repositories for mutlass
Users that are interested in mutlass are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- COCCL: Compression and precision co-aware collective communication library☆30Mar 16, 2025Updated last year
- ☆11Sep 23, 2023Updated 2 years ago
- Fast and Stable Color Balancing for Images and Augmented Reality☆16Dec 19, 2015Updated 10 years ago
- yolov5_obb C++ onnxruntime deployment☆10Mar 11, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆14Jul 9, 2023Updated 2 years ago
- 🦙🦙.🦀☆28Sep 24, 2023Updated 2 years ago
- torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics c…☆484Mar 17, 2026Updated last week
- a static analytical model for LLM distributed training☆126Jan 8, 2026Updated 2 months ago
- ☆11Feb 13, 2025Updated last year
- Parallel implementation of bzip2 using cuda☆32Apr 21, 2011Updated 14 years ago
- ☆27Apr 25, 2024Updated last year
- 【不再维护】本仓库最初目的是为了减轻组内学生书写负担,特别是对于郑大电信学院,轻大李老师重头建立了新一版模版。这太好了,我就不需要花精力维护了,感谢机器人实验室/刘老师李老师,谢谢大家支持!☆14Mar 20, 2024Updated 2 years ago
- Yet Another Aria2 Web Frontend in pure HTML/CSS/Javascirpt☆23Dec 30, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- study of cutlass☆22Nov 10, 2024Updated last year
- some great libraries such as libbase, porting from chromium opensource project, for android ndk project use.☆10May 9, 2020Updated 5 years ago
- GPU Microcontroller Compiler☆24Jul 14, 2013Updated 12 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆63Aug 26, 2013Updated 12 years ago
- ☆49Jun 24, 2025Updated 9 months ago
- 3D Tools for the Windows Presentation Foundation (WPF)☆12Dec 27, 2024Updated last year
- Guide I wrote mostly for myself on how to run mlc-llm on the Orange Pi 5 Pro☆23Aug 15, 2025Updated 7 months ago
- A Docker utility to manager image and tag information from Docker Hub.☆12May 11, 2023Updated 2 years ago
- The implementation of https://github.com/lucasheld/uptime-kuma-api with fastapi☆11Dec 14, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- nlog appender for kafka which provides the custom topics pattern and partitions☆11Nov 10, 2025Updated 4 months ago
- OpenCL compilation with clang compiler.☆27Mar 12, 2025Updated last year
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 4 months ago
- Parallel GMRES (Generalized Minimal Residual) linear solver on GPU platforms☆27Oct 5, 2015Updated 10 years ago
- PorarSSL <-> OpenSSL compatibility layer☆19Jul 2, 2015Updated 10 years ago
- [Unsupported] NodeJS module that calculates a square area and downloads all the map tiles contained in that area☆10Nov 8, 2017Updated 8 years ago
- mirror all your visible repos between gitlab and github☆13Oct 29, 2024Updated last year
- vapoursynth playground, encode your first video here☆14Sep 27, 2025Updated 6 months ago
- ROCm Device Libraries☆96May 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 跨社区工单追踪 & 讨论场所 / Cross-community issue tracker & discussions☆22Oct 13, 2023Updated 2 years ago
- linear algebra package. like gonum/mat, but small. lets say gonum-lite☆12Jul 8, 2023Updated 2 years ago
- high performance .NET library for MQTT based communication support v3.x and v5.0 protocols☆16May 10, 2024Updated last year
- My PwSH prompt☆11Feb 27, 2025Updated last year
- A tool for compiling and linking Zig libraries to Rust projects.☆14Feb 2, 2023Updated 3 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- Work in progress rust bindings to ggml☆12May 1, 2023Updated 2 years ago