songqun/speedup-aarch64-cpu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/songqun/speedup-aarch64-cpu)

songqun / speedup-aarch64-cpu

a computing kernel implementation in ML inference framework aiming at theoretical limit

☆12

Alternatives and similar repositories for speedup-aarch64-cpu

Users that are interested in speedup-aarch64-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

akkaze / compiler
View on GitHub
A toy compiler for subset of c++ written in python
☆16Jan 17, 2025Updated last year
antiagainst / SM-G991U
View on GitHub
Kernel code for Samsung Galaxy S21 (Snapdragon 888)
☆20Jul 4, 2021Updated 5 years ago
vacancy / NaiveCompGraph
View on GitHub
A demo project for a computation graph implementation in C++.
☆11Jul 2, 2019Updated 7 years ago
jimmy-evo / opencl_kernels
View on GitHub
An easy way to run, test, benchmark and tune OpenCL kernel files
☆24Aug 25, 2023Updated 2 years ago
itemhsu / DeepSORT
View on GitHub
C++ deepsort on tensorflow
☆18Apr 4, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mk-fg / cgroup-tools
View on GitHub
A set of tools to work with cgroup tree and process classification/QoS according to it
☆10Oct 1, 2019Updated 6 years ago
fhboswell / llvm-analysis-and-transform-passes
View on GitHub
LLVM passes with usage instructions
☆18Apr 23, 2017Updated 9 years ago
srihari-humbarwadi / FastFCN_TF2.0
View on GitHub
TensorFlow2.0 implementation FastFCN - https://arxiv.org/pdf/1903.11816v1.pdf
☆11Aug 6, 2019Updated 6 years ago
dongzhiyan-stack / async_memory_reclaim_for_cold_file_area
View on GitHub
linux内核异步内存回收的另一个思路：基于冷热文件的冷热区域精准的回收冷文件页page(可做成内核ko)
☆13Jun 14, 2024Updated 2 years ago
Gavinxyj / AVAnalysisTools
View on GitHub
音视频分析工具
☆12May 10, 2017Updated 9 years ago
osanj / lava
View on GitHub
A Highlevel Python Wrapper for Vulkan's Compute API
☆18Apr 13, 2026Updated 3 months ago
Black-Phoenix / CUDA-SfM
View on GitHub
A structure from motion implemention in C++ and accelerated using CUDA
☆48Oct 12, 2019Updated 6 years ago
AmadeusITGroup / CoreDumper
View on GitHub
Clone of https://code.google.com/p/google-coredumper/ with enhancements by Amadeus
☆13Jul 2, 2024Updated 2 years ago
UbiquitousLearning / Mandheling-DSP-Training
View on GitHub
The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]
☆20Aug 4, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cameron314 / atomic_queue
View on GitHub
C++ lock-free queue.
☆14Jun 24, 2020Updated 6 years ago
enp1s0 / cutf
View on GitHub
CUDA Template Functions
☆20Dec 16, 2025Updated 7 months ago
mz24cn / gemm_optimization
View on GitHub
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…
☆17Mar 28, 2019Updated 7 years ago
MingSun-Tse / Caffe_IncReg
View on GitHub
[IJCNN'19, IEEE JSTSP'19] Caffe code for our paper "Structured Pruning for Efficient ConvNets via Incremental Regularization"; [BMVC'18] …
☆14Feb 14, 2020Updated 6 years ago
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆30Mar 6, 2024Updated 2 years ago
nnormandin / YellowFin_Keras
View on GitHub
Modified version of the YellowFin optimizer for TensorFlow to work with the Keras API [not actively maintained]
☆16Jul 28, 2017Updated 9 years ago
SNU-ARC / OpenDNN
View on GitHub
OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library
☆29Dec 9, 2019Updated 6 years ago
StanfordAHA / CGRAFlowDoc
View on GitHub
Documentation for the entire CGRAFlow
☆19Sep 17, 2021Updated 4 years ago
hellogcc / OSDT2019
View on GitHub
OSDT2019相关资料
☆16Nov 17, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hed0rah / fs_monitoring
View on GitHub
dnotify,inotify, and fanotify example code from http://www.lanedo.com/filesystem-monitoring-linux-kernel/
☆14Apr 28, 2017Updated 9 years ago
HeinrichHartmann / zmqdump
View on GitHub
dump zmq messages on a socket
☆15Aug 27, 2023Updated 2 years ago
IronySuzumiya / NiuDianNao
View on GitHub
A simple cycle-accurate DaDianNao simulator
☆13Mar 27, 2019Updated 7 years ago
niushuqing123 / final-project
View on GitHub
一个尝试固液耦合的沙盒玩具
☆11Feb 17, 2025Updated last year
aliyun / aliyun-linkvisual-edge-linkai
View on GitHub
☆14Dec 8, 2022Updated 3 years ago
abeardear / ncnn-yolo
View on GitHub
convert pytorch trained yolo model to ncnn for Flexible deployment
☆10Aug 30, 2018Updated 7 years ago
dmrauch / vscode-cpp-remote-debug
View on GitHub
Project for testing remote debugging of C++ code with gdb and gdbserver in VS Code
☆20Jun 6, 2018Updated 8 years ago
Leedehai / C-include-2-dot
View on GitHub
C/C++ header dependency list generator. Output can be used to create a dependency graph.
☆15Apr 13, 2021Updated 5 years ago
cilkplus / cilkplus.github.com
View on GitHub
☆16Aug 11, 2016Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
opennvm / nvm-primitives
View on GitHub
NVM user-space Primitives API library repository
☆18Mar 12, 2014Updated 12 years ago
ydsf16 / vslam
View on GitHub
Basic algorithms for vslam.
☆54Nov 20, 2020Updated 5 years ago
long123king / PE-Replay
View on GitHub
This is a pintool that can analyze target dynamically and output code blocks and "key frames".
☆14Mar 26, 2015Updated 11 years ago
viscloud / saf
View on GitHub
SAF: Streaming Analytics Framework
☆31Mar 6, 2019Updated 7 years ago
JiangTingjia / GL4A_logger
View on GitHub
log, 仅包含头文件，追踪崩溃和数据的日志库
☆16Dec 25, 2018Updated 7 years ago
L1B0 / linux-hotpatch
View on GitHub
Linux热补丁实践
☆18Jun 11, 2019Updated 7 years ago
amri369 / Pytorch-Iternet
View on GitHub
☆19Jul 17, 2026Updated last week