a computing kernel implementation in ML inference framework aiming at theoretical limit
☆12Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for speedup-aarch64-cpu
Users that are interested in speedup-aarch64-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toy compiler for subset of c++ written in python☆16Jan 17, 2025Updated last year
- Kernel code for Samsung Galaxy S21 (Snapdragon 888)☆20Jul 4, 2021Updated 4 years ago
- A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0☆10Nov 26, 2020Updated 5 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Simple task scheduler☆11May 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A general performance test framework for Distributed File System☆15Oct 8, 2020Updated 5 years ago
- C++ deepsort on tensorflow☆18Apr 4, 2020Updated 6 years ago
- C/C++ Dynamic Memory Analyzer (CMA)☆18Jul 29, 2014Updated 11 years ago
- This is the extension of Mask RCNN model to 3D images.☆12May 7, 2019Updated 7 years ago
- A set of tools to work with cgroup tree and process classification/QoS according to it☆10Oct 1, 2019Updated 6 years ago
- NVM user-space Primitives API library repository☆18Mar 12, 2014Updated 12 years ago
- Old academic project for my PhD - no longer maintained by me: fast gaussian and derivative convolutional filters☆10May 1, 2019Updated 7 years ago
- ☆16Aug 11, 2016Updated 9 years ago
- A structure from motion implemention in C++ and accelerated using CUDA☆48Oct 12, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 个人笔记☆16Apr 1, 2026Updated last month
- Clone of https://code.google.com/p/google-coredumper/ with enhancements by Amadeus☆13Jul 2, 2024Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Reed-Solomon Erasure Coding in Haskell☆23Jan 22, 2017Updated 9 years ago
- CUDA Template Functions☆20Dec 16, 2025Updated 4 months ago
- AES-OpenCL - An AES implementation in OpenCL. This is the source code for my Bachelor's diploma. I did research on accelerating cryptogra…☆25Feb 7, 2013Updated 13 years ago
- ☆20Oct 3, 2023Updated 2 years ago
- The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…☆17Mar 28, 2019Updated 7 years ago
- Last Writer Slicing: data provenance tracking for concurrent program debugging & analysis☆13Nov 14, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IJCNN'19, IEEE JSTSP'19] Caffe code for our paper "Structured Pruning for Efficient ConvNets via Incremental Regularization"; [BMVC'18] …☆14Feb 14, 2020Updated 6 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- disk prediction papers☆19Oct 24, 2020Updated 5 years ago
- LLVM passes with usage instructions☆18Apr 23, 2017Updated 9 years ago
- ☆14Jun 9, 2023Updated 2 years ago
- dnotify,inotify, and fanotify example code from http://www.lanedo.com/filesystem-monitoring-linux-kernel/☆14Apr 28, 2017Updated 9 years ago
- OSDT2019相关资料☆16Nov 17, 2019Updated 6 years ago
- An AI/ML solution that provides a probability that a hard drive will fail within some pre-defined time period.☆13Dec 9, 2022Updated 3 years ago
- This is a caffe implementation of ShuffleNet model☆15Mar 15, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- C/C++ header dependency list generator. Output can be used to create a dependency graph.☆14Apr 13, 2021Updated 5 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- This is a pintool that can analyze target dynamically and output code blocks and "key frames".☆14Mar 26, 2015Updated 11 years ago
- Multi-Source Domain Adaptation via Optimal Transport for Student-Teacher Learning - UAI 2021☆22Jan 26, 2025Updated last year
- 3d mask rcnn☆19May 13, 2019Updated 6 years ago