umitkacar / ai-edge-computing-tiny-embeddedLinks
☆19Updated last year
Alternatives and similar repositories for ai-edge-computing-tiny-embedded
Users that are interested in ai-edge-computing-tiny-embedded are comparing it to the libraries listed below
Sorting:
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Updated last year
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆67Updated 10 months ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆16Updated last year
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆18Updated 3 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- Converts CLIP models to ONNX☆11Updated 2 years ago
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆27Updated 2 years ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆61Updated 3 months ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated last year
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆34Updated 3 years ago
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"☆90Updated 7 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆46Updated 10 months ago
- Model compression for ONNX☆97Updated 8 months ago
- A simple Python tool to measure the performance of ONNX models.☆27Updated 11 months ago
- Build TensorFlow Lite runtime with GitHub Actions☆25Updated 3 weeks ago
- Implementation of transformers based architecture in PyTorch.☆54Updated 4 years ago
- Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.☆18Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆57Updated last year
- Code of the paper "Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analys…☆25Updated last year
- How to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO☆27Updated 5 months ago
- ☆11Updated 4 years ago
- Efficient Multi-Object Tracking for Edge devices☆13Updated last year
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆52Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆58Updated last week