Support PyTorch model conversion with LiteRT.
☆1,044Jun 8, 2026Updated this week
Alternatives and similar repositories for litert-torch
Users that are interested in litert-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,511Updated this week
- A tool for converting ONNX files to LiteRT/TFLite/TensorFlow, PyTorch native code (nn.Module), TorchScript (.pt), state_dict (.pt), Expor…☆967Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,716Updated this week
- A modern model graph visualizer and debugger☆1,491Updated this week
- Pytorch to Keras/Tensorflow/TFLite conversion made intuitive☆344Mar 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Nov 30, 2023Updated 2 years ago
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆1,076Updated this week
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆879Mar 3, 2026Updated 3 months ago
- Scripts for converting Keras CV Stable Diffusion to tflite☆33Jun 6, 2024Updated 2 years ago
- Generative AI extensions for onnxruntime☆1,047Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,355Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,634Updated this week
- ☆339May 28, 2026Updated last week
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆28Feb 21, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,444Apr 30, 2026Updated last month
- Fast Multimodal LLM on Mobile Devices☆1,532Apr 30, 2026Updated last month
- convert a pytorch model to a model for edge device☆20Mar 15, 2023Updated 3 years ago
- Tensorflow Backend for ONNX☆1,322Mar 28, 2024Updated 2 years ago
- On-device Neural Engine☆570May 31, 2026Updated last week
- Remote source nodes for NNStreamer pipelines without GStreamer dependencies☆17Mar 11, 2026Updated 2 months ago
- ☆2,728Apr 9, 2026Updated 2 months ago
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,947Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆699Mar 29, 2024Updated 2 years ago
- ☆194Updated this week
- Backward compatible ML compute opset inspired by HLO/MHLO☆656May 27, 2026Updated 2 weeks ago
- MediaTek's TFLite delegate☆53Dec 8, 2025Updated 6 months ago
- Demonstration of running a native LLM on Android device.☆255Updated this week
- Simplify your onnx model☆4,346Jun 1, 2026Updated last week
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆36Feb 13, 2026Updated 3 months ago
- ☆343Feb 12, 2026Updated 3 months ago
- ☆25Sep 19, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,652Updated this week
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆948Nov 27, 2024Updated last year
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- ONNX model visualizer☆88Jun 28, 2023Updated 2 years ago
- ☆13May 11, 2026Updated last month
- TinyChatEngine: On-Device LLM Inference Library☆953Jul 4, 2024Updated last year
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆305Apr 22, 2024Updated 2 years ago