unit-mesh / edge-inferLinks
EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making. EdgeInfer 旨在资源受限的设备上运行小型 AI 模型(包括向量化和 Onnx 模型),如 Android、iOS 或 MCUs,实现高效的边缘智能,用于实时决策。
☆45Updated last year
Alternatives and similar repositories for edge-infer
Users that are interested in edge-infer are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆62Updated 3 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Updated last year
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated last month
- Various LLM Benchmarks☆20Updated last week
- coze api to openai☆14Updated 9 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 8 months ago
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆19Updated this week
- A transformer-based multimodal model for music.☆28Updated 9 months ago
- ☆26Updated 9 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 4 months ago
- 将零一万物 YI-34B 模型 API 转换为各种使用 OpenAI API 的开源软件支持的格式,无需修改开源软件配置或代码。☆11Updated last year
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆34Updated 2 months ago
- support BM25+vecetor☆29Updated last week
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated 3 weeks ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆31Updated 9 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Rust implementation of Surya☆58Updated 3 months ago
- A voice assistant that runs completely on your local device.☆19Updated 2 weeks ago
- A tiny 1000 line implementation of GraphRAG in Python☆71Updated 3 months ago
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆62Updated last week
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆74Updated 7 months ago
- Run GreenBitAI's Quantized LLMs on Apple Devices with MLX☆25Updated 3 weeks ago
- AI-native application framework and runtime, simply write a YAML file.☆51Updated last year
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆112Updated last year
- Jina DeepSearch UI☆111Updated this week
- ☆101Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆21Updated 7 months ago
- Real time faster whisper gradio☆26Updated 7 months ago