umitkacar / ai-edge-computing-tiny-embeddedLinks
☆18Updated last year
Alternatives and similar repositories for ai-edge-computing-tiny-embedded
Users that are interested in ai-edge-computing-tiny-embedded are comparing it to the libraries listed below
Sorting:
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- Bag of MLP☆20Updated 4 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆15Updated last year
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated last year
- Converts CLIP models to ONNX☆11Updated 2 years ago
- Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"☆87Updated 5 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆28Updated 4 years ago
- Knock your images before you get stressed.☆10Updated 3 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- Official implementation "ChipNet: Budget-Aware Pruning with Heaviside Continuous Approximations"☆20Updated 2 years ago
- Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)☆18Updated 2 years ago
- Segment Anything Model (SAM) interactive demo with OpenVINO☆12Updated last year
- ☆21Updated 2 years ago
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆12Updated last month
- ☆19Updated 3 years ago
- LDC: Lightweight Dense CNN for Edge DetectionのPythonでのONNX推論サンプル☆15Updated 2 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆63Updated 9 months ago
- PyTorch Implementation of Backbone of PicoDet☆8Updated 3 years ago
- Code for Federated Neuromorphic Learning of Spiking Neural Networks for Low-Power Edge Intelligence☆15Updated 4 years ago
- Demonstrates how to divide a DL model into multiple IR model files (division) and introduce a simplest way to implement a custom layer wo…☆12Updated 3 years ago
- Export pytorch model to ONNX and convert ONNX from float32 to float 16☆11Updated 2 years ago
- Sample implementation of 3D object detection with Intel OpenVINO☆15Updated 4 years ago
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆16Updated 4 years ago
- Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.☆17Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆39Updated 2 years ago
- About DNN compression and acceleration on Edge Devices.☆55Updated 4 years ago