nota-github / MLC-VLM-templateLinks
Mobile (i.e., Android, iOS) foundation model (i.e., LLM, VLM) deployed with MLC
☆16Updated 3 months ago
Alternatives and similar repositories for MLC-VLM-template
Users that are interested in MLC-VLM-template are comparing it to the libraries listed below
Sorting:
- Running Microsoft's BitNet via Electron, React & Astro☆39Updated this week
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆11Updated last week
- Passively collect images for computer vision datasets on the edge.☆33Updated last year
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆20Updated last month
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- Self-host LLMs with LMDeploy and BentoML☆19Updated 2 months ago
- ☆46Updated 2 months ago
- AI Assistant running within your browser.☆67Updated 6 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆23Updated 2 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated last month
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆17Updated 3 weeks ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆17Updated last year
- ☆17Updated 11 months ago
- Self-hosted AI medical scribe.☆32Updated this week
- 👁️ Multimodal LLM vision multitool☆27Updated 7 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 6 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 5 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 8 months ago
- Real-Time Open-Vocabulary Object Detection☆13Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆15Updated 9 months ago
- This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for…☆19Updated 7 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆36Updated 3 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆24Updated last year
- ☆130Updated 9 months ago
- ☆18Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆16Updated last year