☆14Nov 9, 2024Updated last year
Alternatives and similar repositories for cuasmrl
Users that are interested in cuasmrl are comparing it to the libraries listed below
Sorting:
- ☆17Jan 24, 2024Updated 2 years ago
- OSDI 2023 Welder, deeplearning compiler☆32Nov 24, 2023Updated 2 years ago
- Programmable JIT Compilation and Optimization for C/C++ using LLVM☆45Updated this week
- ☆10Sep 28, 2020Updated 5 years ago
- A design of 15-order FIR filter using Verilog, with modulation and demodulation system using MATLAB☆10Aug 15, 2020Updated 5 years ago
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch☆39Mar 27, 2025Updated 11 months ago
- ☆13May 8, 2025Updated 9 months ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Automated bottleneck detection and solution orchestration☆19Updated this week
- ☆12May 18, 2024Updated last year
- ☆11Jun 29, 2021Updated 4 years ago
- ☆11Jul 2, 2025Updated 8 months ago
- 基于 HarmonyOS 的简易分布式 Todolist App,浙江大学短学期课程作业☆10Jul 15, 2021Updated 4 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆14Feb 14, 2020Updated 6 years ago
- ☆12Jan 7, 2025Updated last year
- UCAS网络登录☆13Nov 17, 2018Updated 7 years ago
- PyDTNN - Python Distributed Training of Neural Networks☆14Feb 20, 2026Updated last week
- A fast alternative to the standard C/C++ pow() function. With adjustable accuracy-space tradeoff.☆14Jul 12, 2013Updated 12 years ago
- This repository is the accompanying code for the paper CFVFP. This paper presents a new algorithm for solving incomplete information game…☆14Feb 23, 2025Updated last year
- This is an efficient cuda implementation of 2D depthwise convolution for large kernel, it can be used in Pytorch deep learning framework.☆11Sep 28, 2023Updated 2 years ago
- ☆13Sep 19, 2024Updated last year
- 中国科学技术大学龙芯杯参赛作品仓库合集☆16Oct 2, 2024Updated last year
- Layout, rendering ELK Graph generated by easysoc-firrtl, and display the graph as an interactive diagram to represent Chisel generated Fi…☆12Apr 1, 2022Updated 3 years ago
- Mamba-Spike——CGI2024☆13Dec 3, 2025Updated 2 months ago
- A JIT compiler for the BBC micro:bit☆11Apr 29, 2018Updated 7 years ago
- 召唤之巅:七圣召唤赛事资料站☆15Updated this week
- IPXACT packaging utilities for Chisel 3.x using Xilinx Vivado Design Suite.☆12Dec 5, 2018Updated 7 years ago
- ☆13Sep 11, 2020Updated 5 years ago
- ☆13Mar 6, 2023Updated 2 years ago
- iOS: decode audio data by audio converter (aac(...other compression format)->pcm)☆12May 19, 2019Updated 6 years ago
- ☆13Nov 1, 2021Updated 4 years ago
- ☆18Oct 29, 2025Updated 4 months ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Jun 24, 2025Updated 8 months ago
- ☆24Jun 12, 2023Updated 2 years ago
- ☆17Dec 8, 2023Updated 2 years ago
- Code for Federated Neuromorphic Learning of Spiking Neural Networks for Low-Power Edge Intelligence☆17Dec 9, 2020Updated 5 years ago
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago