google-ai-edge / LiteRTLinks

LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're expanding our vision with a new generation of APIs designed for superior performance and simplified hardware acceleration. Discover what's next for on-device AI.

☆688

Alternatives and similar repositories for LiteRT

Users that are interested in LiteRT are comparing it to the libraries listed below

Sorting:

google-ai-edge / ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
☆732Updated this week
quic / ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.)…
☆756Updated this week
quic / ai-hub-apps
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…
☆252Updated this week
pytorch / executorch
On-device AI across mobile, embedded and edge for PyTorch
☆3,111Updated this week
openvinotoolkit / openvino.genai
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
☆316Updated this week
google-ai-edge / ai-edge-quantizer
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆56Updated last week
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆783Updated this week
intel / intel-npu-acceleration-library
Intel® NPU Acceleration Library
☆680Updated 3 months ago
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆62Updated 2 weeks ago
tensorflow / tflite-support
TFLite Support is a toolkit that helps users to develop ML and deploy TFLite models onto mobile / ioT devices.
☆416Updated 2 weeks ago
google-ai-edge / LiteRT-LM
☆290Updated this week
moonshine-ai / useful-transformers
Efficient Inference of Transformer models
☆443Updated last year
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆404Updated last week
mit-han-lab / TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
☆879Updated last year
quic / qidk
☆149Updated last month
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆369Updated this week
stevelaskaridis / awesome-mobile-llm
Awesome Mobile LLMs
☆226Updated last week
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆481Updated this week
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆205Updated last month
PINTO0309 / onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…
☆836Updated 2 weeks ago
huggingface / optimum-executorch
🤗 Optimum ExecuTorch
☆58Updated this week
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆289Updated last year
google-ai-edge / model-explorer
A modern model graph visualizer and debugger
☆1,293Updated this week
google / XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,079Updated this week
microsoft / onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
☆1,441Updated last week
SonySemiconductorSolutions / mct-model-optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…
☆409Updated 3 weeks ago
NVIDIA / cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
☆598Updated 3 weeks ago
NVIDIA / TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …
☆1,093Updated this week
google-ai-edge / ai-edge-apis
☆122Updated this week
kraiskil / onnx2c
Open Neural Network Exchange to C compiler.
☆303Updated last month