umitkacar / ai-edge-computing-tiny-embedded
☆14Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for ai-edge-computing-tiny-embedded
- ☆20Updated 2 years ago
- ☆12Updated 5 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆13Updated last year
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆27Updated 2 months ago
- Lottery Ticket Adaptation☆36Updated last month
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆17Updated this week
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 2 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆41Updated last month
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆27Updated last week
- Official implementation for the paper "Automatic Neural Network Pruning that Efficiently Preserves the Model Accuracy".☆9Updated last year
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Updated 4 months ago
- Export pytorch model to ONNX and convert ONNX from float32 to float 16☆10Updated last year
- ☆18Updated 3 years ago
- Code repo for the paper "AIO-P: Expanding Neural Performance Predictors Beyond Image Classification", accepted to AAAI-23.☆10Updated 5 months ago
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code.☆9Updated 2 years ago
- Awesome Quantization Paper lists with Codes☆12Updated 3 years ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆30Updated 2 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- [ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe☆20Updated 2 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 5 months ago
- ☆12Updated 7 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated this week
- Open Source Projects from Pallas Lab☆19Updated 3 years ago
- ☆13Updated last year
- ☆12Updated 2 months ago
- [ICCV 2023] Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks☆20Updated last year
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆28Updated 4 months ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆13Updated last year