a computing kernel implementation in ML inference framework aiming at theoretical limit
☆12Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for speedup-aarch64-cpu
Users that are interested in speedup-aarch64-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kernel code for Samsung Galaxy S21 (Snapdragon 888)☆20Jul 4, 2021Updated 4 years ago
- ☆14Feb 26, 2026Updated 3 months ago
- 阴阳师御魂方案计算工具,基于动态规划和剪枝☆14Sep 3, 2018Updated 7 years ago
- A demo project for a computation graph implementation in C++.☆11Jul 2, 2019Updated 6 years ago
- A general performance test framework for Distributed File System☆15Oct 8, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- C++ deepsort on tensorflow☆18Apr 4, 2020Updated 6 years ago
- NVM user-space Primitives API library repository☆18Mar 12, 2014Updated 12 years ago
- Old academic project for my PhD - no longer maintained by me: fast gaussian and derivative convolutional filters☆10May 1, 2019Updated 7 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- C++ lock-free queue.☆14Jun 24, 2020Updated 5 years ago
- A ConvGRU2D (equivalent to ConvLSTM2D) is added to Keras with a corresponding example☆11May 28, 2018Updated 8 years ago
- AES-OpenCL - An AES implementation in OpenCL. This is the source code for my Bachelor's diploma. I did research on accelerating cryptogra…☆25Feb 7, 2013Updated 13 years ago
- ☆20Oct 3, 2023Updated 2 years ago
- Last Writer Slicing: data provenance tracking for concurrent program debugging & analysis☆13Nov 14, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 用ATSS训练自己的目标检测模型!! 超详细教程和PDF教程下载!!!☆10Jul 28, 2020Updated 5 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library☆29Dec 9, 2019Updated 6 years ago
- Documentation for the entire CGRAFlow☆19Sep 17, 2021Updated 4 years ago
- 一个尝试固液耦合的沙盒玩具☆11Feb 17, 2025Updated last year
- This is a caffe implementation of ShuffleNet model☆15Mar 15, 2018Updated 8 years ago
- C/C++ header dependency list generator. Output can be used to create a dependency graph.☆14Apr 13, 2021Updated 5 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Sep 27, 2018Updated 7 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- Multi-Source Domain Adaptation via Optimal Transport for Student-Teacher Learning - UAI 2021☆22Jan 26, 2025Updated last year
- Domain Aggregation Networks☆22Jul 14, 2020Updated 5 years ago
- log, 仅包含头文件,追踪崩溃和数据的日志库☆16Dec 25, 2018Updated 7 years ago
- Linux热补丁实践☆18Jun 11, 2019Updated 6 years ago
- To better understand the ggml library☆27Jun 13, 2025Updated 11 months ago
- Training code of 'Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network'. https://arxiv.org/abs/1803.0783…☆10Aug 14, 2018Updated 7 years ago
- A VS Code extension to ease log reading and analysis☆11Jan 23, 2024Updated 2 years ago
- ☆13Dec 4, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Evaluating efficiency of several types of convolutions☆30Jun 5, 2017Updated 8 years ago
- MTCNN light + SORT tracking☆44Feb 24, 2020Updated 6 years ago
- ☆29May 2, 2019Updated 7 years ago
- ☆14Mar 26, 2020Updated 6 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- All latest models and approaches on prediction storage device failure☆17May 21, 2019Updated 7 years ago
- linux-kernel with warpdrive☆14Jan 9, 2021Updated 5 years ago