Support PyTorch model conversion with LiteRT.
☆1,007Apr 24, 2026Updated this week
Alternatives and similar repositories for litert-torch
Users that are interested in litert-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,255Apr 23, 2026Updated last week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆126Updated this week
- A tool for converting ONNX files to LiteRT/TFLite/TensorFlow, PyTorch native code (nn.Module), TorchScript (.pt), state_dict (.pt), Expor…☆951Apr 1, 2026Updated 3 weeks ago
- On-device AI across mobile, embedded and edge for PyTorch☆4,547Updated this week
- A modern model graph visualizer and debugger☆1,449Apr 21, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Pytorch to Keras/Tensorflow/TFLite conversion made intuitive☆345Mar 10, 2025Updated last year
- ☆16Nov 30, 2023Updated 2 years ago
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆1,009Updated this week
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆877Mar 3, 2026Updated last month
- Scripts for converting Keras CV Stable Diffusion to tflite☆33Jun 6, 2024Updated last year
- Generative AI extensions for onnxruntime☆1,014Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,326Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,604Updated this week
- ☆287Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆28Feb 21, 2023Updated 3 years ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,428Apr 21, 2025Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,477Apr 12, 2026Updated 2 weeks ago
- convert a pytorch model to a model for edge device☆20Mar 15, 2023Updated 3 years ago
- Tensorflow Backend for ONNX☆1,324Mar 28, 2024Updated 2 years ago
- On-device Neural Engine☆562Updated this week
- ☆2,678Apr 9, 2026Updated 3 weeks ago
- Remote source nodes for NNStreamer pipelines without GStreamer dependencies☆17Mar 11, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MediaTek's TFLite delegate☆52Dec 8, 2025Updated 4 months ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆688Mar 29, 2024Updated 2 years ago
- ☆188Updated this week
- Backward compatible ML compute opset inspired by HLO/MHLO☆645Apr 22, 2026Updated last week
- Demonstration of running a native LLM on Android device.☆246Apr 12, 2026Updated 2 weeks ago
- Simplify your onnx model☆4,328Updated this week
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆36Feb 13, 2026Updated 2 months ago
- ☆343Feb 12, 2026Updated 2 months ago
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,879Apr 22, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,623Apr 23, 2026Updated last week
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆942Nov 27, 2024Updated last year
- ONNX model visualizer☆88Jun 28, 2023Updated 2 years ago
- ☆13Apr 13, 2026Updated 2 weeks ago
- TinyChatEngine: On-Device LLM Inference Library☆953Jul 4, 2024Updated last year
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆304Apr 22, 2024Updated 2 years ago
- This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025☆1,510Apr 15, 2026Updated 2 weeks ago