nota-github / MLC-VLM-templateLinks
Mobile (i.e., Android, iOS) foundation model (i.e., LLM, VLM) deployed with MLC
☆24Updated 11 months ago
Alternatives and similar repositories for MLC-VLM-template
Users that are interested in MLC-VLM-template are comparing it to the libraries listed below
Sorting:
- This project is an implementation of fine-tuning the Gemma 2b-it model on a custom dataset and deploy the model on Android.☆63Updated last year
- ☆109Updated 5 months ago
- Summarize any Arixv Paper with ease☆66Updated 2 years ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆130Updated last year
- powerful and fast tool calling agents☆80Updated 10 months ago
- One click templates for inferencing Language Models☆228Updated 2 months ago
- This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for…☆20Updated last year
- ☆27Updated last year
- Embed anything.☆27Updated last year
- ⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀☆63Updated 2 weeks ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Awesome Mobile LLMs☆301Updated 2 months ago
- Self-host LLMs with vLLM and BentoML☆168Updated 2 weeks ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- An Open-Source Modular AI Assistant☆32Updated 10 months ago
- Own your AI, search the web with it🌐😎☆94Updated last year
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- Set of scripts to finetune LLMs☆38Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- ☆40Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆171Updated 2 years ago
- An open source risk-management tool built for stock and security risk analysis☆38Updated 3 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- REAP: Router-weighted Expert Activation Pruning for SMoE compression☆232Updated last month
- Eh, simple and works.☆27Updated 2 years ago
- [ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices☆668Updated 8 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆140Updated this week
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆90Updated last year
- Local first human friendly agents toolkit for the browser and Nodejs☆45Updated this week