google-ai-edge / LiteRT-LMView external linksLinks
☆797Updated this week
Alternatives and similar repositories for LiteRT-LM
Users that are interested in LiteRT-LM are comparing it to the libraries listed below
Sorting:
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,444Updated this week
- This repository hosts the official project files of Team J.E.E.P. from the Philippines for the FE competition in WRO 2025, detailing thei…☆300Nov 28, 2025Updated 2 months ago
- Support PyTorch model conversion with LiteRT.☆935Feb 7, 2026Updated last week
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 3 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,605Updated this week
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Feb 5, 2026Updated last week
- Agentic Research and Evaluation Suite☆71Updated this week
- python越南语分词器☆10Nov 14, 2019Updated 6 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Jan 5, 2026Updated last month
- Custom Template for checking the availiability of an entity.☆13Oct 31, 2025Updated 3 months ago
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Jan 19, 2026Updated 3 weeks ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆35Jul 14, 2025Updated 6 months ago
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆81Nov 25, 2025Updated 2 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 3 months ago
- ChatGPTをLINE botで触るハンズオン☆18Jun 28, 2023Updated 2 years ago
- Let's sudo by face recognition of Windows Hello on Windows Subsystem for Linux (WSL). It runs on both WSL 1 and WSL 2. This is a PAM modu…☆20Mar 22, 2025Updated 10 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago
- A curated list of awesome Maker Resource to learn programming, making, hardware, software.☆11Apr 22, 2018Updated 7 years ago
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 2 months ago
- A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.☆15,102Feb 4, 2026Updated last week
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Oct 3, 2024Updated last year
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆798Feb 4, 2026Updated last week
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- NextPlaid, ColGREP: Multi-vector search, from database to coding agents.☆108Updated this week
- ☆15Feb 16, 2025Updated 11 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 5, 2026Updated last week
- Efficient non-uniform quantization with GPTQ for GGUF☆58Sep 17, 2025Updated 4 months ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,255Updated this week
- Official inference framework for 1-bit LLMs☆28,054Feb 3, 2026Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,135Updated this week
- A recursive coding agent inpired by RLMs☆93Updated this week
- ☆16May 31, 2024Updated last year
- ☆16Apr 30, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- A c++ framework on efficient training & fine-tuning LLMs☆27Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,258Updated this week
- Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…☆39Sep 22, 2025Updated 4 months ago
- ☆33Dec 8, 2024Updated last year
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 4 months ago