a computing kernel implementation in ML inference framework aiming at theoretical limit
☆12Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for speedup-aarch64-cpu
Users that are interested in speedup-aarch64-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toy compiler for subset of c++ written in python☆16Jan 17, 2025Updated last year
- Kernel code for Samsung Galaxy S21 (Snapdragon 888)☆19Jul 4, 2021Updated 4 years ago
- Run OpenCL program on MOBILE GPU (Qualcomm & ARM) !☆18Jun 27, 2018Updated 7 years ago
- 阴阳师御魂方案计算工具,基于动态规划和剪枝☆14Sep 3, 2018Updated 7 years ago
- A demo project for a computation graph implementation in C++.☆11Jul 2, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple task scheduler☆11May 16, 2023Updated 2 years ago
- C++ deepsort on tensorflow☆18Apr 4, 2020Updated 6 years ago
- C/C++ Dynamic Memory Analyzer (CMA)☆18Jul 29, 2014Updated 11 years ago
- A set of tools to work with cgroup tree and process classification/QoS according to it☆10Oct 1, 2019Updated 6 years ago
- Old academic project for my PhD - no longer maintained by me: fast gaussian and derivative convolutional filters☆10May 1, 2019Updated 6 years ago
- 音视频分析工具☆12May 10, 2017Updated 8 years ago
- ☆16Aug 11, 2016Updated 9 years ago
- A Highlevel Python Wrapper for Vulkan's Compute API☆18Jan 14, 2026Updated 3 months ago
- A structure from motion implemention in C++ and accelerated using CUDA☆48Oct 12, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Clone of https://code.google.com/p/google-coredumper/ with enhancements by Amadeus☆13Jul 2, 2024Updated last year
- Reed-Solomon Erasure Coding in Haskell☆23Jan 22, 2017Updated 9 years ago
- CUDA Template Functions☆20Dec 16, 2025Updated 4 months ago
- linux内核异步内存回收的另一个思路:基于冷热文件的冷热区域精准的回收冷文件页page(可做成内核ko)☆12Jun 14, 2024Updated last year
- ☆20Oct 3, 2023Updated 2 years ago
- The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…☆17Mar 28, 2019Updated 7 years ago
- [IJCNN'19, IEEE JSTSP'19] Caffe code for our paper "Structured Pruning for Efficient ConvNets via Incremental Regularization"; [BMVC'18] …☆14Feb 14, 2020Updated 6 years ago
- 用ATSS训练自己的目标检测模型!! 超详细教程和PDF教程下载!!!☆10Jul 28, 2020Updated 5 years ago
- OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library☆27Dec 9, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- a simple epoll wrapper☆21Apr 8, 2011Updated 15 years ago
- Pytorch implementation of DAC-Net ("Zhongying Deng, Kaiyang Zhou, Yongxin Yang, Tao Xiang. Domain Attention Consistency for Multi-Source …☆24Dec 13, 2021Updated 4 years ago
- Documentation for the entire CGRAFlow☆19Sep 17, 2021Updated 4 years ago
- disk prediction papers☆19Oct 24, 2020Updated 5 years ago
- LLVM passes with usage instructions☆18Apr 23, 2017Updated 8 years ago
- ☆13Jun 9, 2023Updated 2 years ago
- dnotify,inotify, and fanotify example code from http://www.lanedo.com/filesystem-monitoring-linux-kernel/☆14Apr 28, 2017Updated 8 years ago
- OSDT2019相关资料☆16Nov 17, 2019Updated 6 years ago
- This is a caffe implementation of ShuffleNet model☆15Mar 15, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- strace-perfetto runs strace and converts the raw output to a Trace Event JSON file. The JSON file can then be analyzed using Google's Per…☆12Apr 27, 2022Updated 3 years ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- Project for testing remote debugging of C++ code with gdb and gdbserver in VS Code☆20Jun 6, 2018Updated 7 years ago
- This is a pintool that can analyze target dynamically and output code blocks and "key frames".☆14Mar 26, 2015Updated 11 years ago