unit-mesh / edge-inferLinks
EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making. EdgeInfer 旨在资源受限的设备上运行小型 AI 模型(包括向量化和 Onnx 模型),如 Android、iOS 或 MCUs,实现高效的边缘智能,用于实时决策。
☆45Updated last year
Alternatives and similar repositories for edge-infer
Users that are interested in edge-infer are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆66Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 2 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆122Updated this week
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆113Updated last year
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆31Updated 10 months ago
- WIP. Apps (100+) + AI.☆30Updated 10 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- A transformer-based multimodal model for music.☆28Updated 11 months ago
- Proof of concept for running moshi/hibiki using webrtc☆20Updated 4 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆52Updated last month
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆27Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 9 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆21Updated 2 months ago
- support BM25+vecetor☆29Updated last month
- Various LLM Benchmarks☆24Updated last month
- Conversational Retrieval Evaluation Dataset☆101Updated 4 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆40Updated 4 months ago
- Rust implementation of Surya☆58Updated 4 months ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆21Updated 2 months ago
- ☆101Updated 10 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆53Updated last year
- ☆26Updated 10 months ago
- Jina DeepSearch UI☆117Updated 2 weeks ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆74Updated 8 months ago
- Eko demos☆28Updated this week
- Turn any OCR models into online inference API endpoint 🚀 🌖☆56Updated 3 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- An abstraction library for building domain-specific intelligent agents based on Large Language Models (LLMs). LLMAgent provides a core ar…☆27Updated 3 months ago
- A voice assistant that runs completely on your local device.☆44Updated this week