a computing kernel implementation in ML inference framework aiming at theoretical limit
☆12Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for speedup-aarch64-cpu
Users that are interested in speedup-aarch64-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toy compiler for subset of c++ written in python☆16Jan 17, 2025Updated last year
- ☆14Feb 26, 2026Updated 3 months ago
- A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0☆10Nov 26, 2020Updated 5 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- A demo project for a computation graph implementation in C++.☆11Jul 2, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple task scheduler☆11May 16, 2023Updated 3 years ago
- A general performance test framework for Distributed File System☆15Oct 8, 2020Updated 5 years ago
- C++ deepsort on tensorflow☆18Apr 4, 2020Updated 6 years ago
- NVM user-space Primitives API library repository☆18Mar 12, 2014Updated 12 years ago
- Old academic project for my PhD - no longer maintained by me: fast gaussian and derivative convolutional filters☆10May 1, 2019Updated 7 years ago
- TensorFlow2.0 implementation FastFCN - https://arxiv.org/pdf/1903.11816v1.pdf☆11Aug 6, 2019Updated 6 years ago
- A Highlevel Python Wrapper for Vulkan's Compute API☆18Apr 13, 2026Updated 2 months ago
- Clone of https://code.google.com/p/google-coredumper/ with enhancements by Amadeus☆13Jul 2, 2024Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- C++ lock-free queue.☆14Jun 24, 2020Updated 5 years ago
- ☆20Oct 3, 2023Updated 2 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library☆29Dec 9, 2019Updated 6 years ago
- disk prediction papers☆19Oct 24, 2020Updated 5 years ago
- LLVM passes with usage instructions☆18Apr 23, 2017Updated 9 years ago
- This is a caffe implementation of ShuffleNet model☆15Mar 15, 2018Updated 8 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple, extensible build system for use with Scons☆24Apr 30, 2026Updated last month
- Basic algorithms for vslam.☆54Nov 20, 2020Updated 5 years ago
- Linux热补丁实践☆18Jun 11, 2019Updated 7 years ago
- ☆19Jun 10, 2026Updated last week
- ☆13Dec 4, 2018Updated 7 years ago
- Evaluating efficiency of several types of convolutions☆30Jun 5, 2017Updated 9 years ago
- MTCNN light + SORT tracking☆44Feb 24, 2020Updated 6 years ago
- A cross-platform C function to allocate aligned memory☆22Oct 14, 2019Updated 6 years ago
- ☆29May 2, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Mar 26, 2020Updated 6 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- A Lossless Compression Algorithm☆13Jan 21, 2018Updated 8 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- ☆11Nov 6, 2019Updated 6 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Regress Face Attributes with MobileNetV2☆44Dec 13, 2019Updated 6 years ago