unit-mesh / edge-inferLinks
EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making. EdgeInfer 旨在资源受限的设备上运行小型 AI 模型(包括向量化和 Onnx 模型),如 Android、iOS 或 MCUs,实现高效的边缘智能,用于实时决策。
☆49Updated last year
Alternatives and similar repositories for edge-infer
Users that are interested in edge-infer are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 7 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆107Updated 5 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- MiniCPM on iOS.☆67Updated 9 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆115Updated last year
- Jina DeepSearch UI☆127Updated 4 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 10 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- Rust implementation of Surya☆63Updated 10 months ago
- TiDB Vector SDK for Python, including code examples. Join our Discord: https://discord.gg/XzSW23Jg9p☆60Updated 5 months ago
- support BM25+vecetor☆29Updated 7 months ago
- WIP. Apps (100+) + AI.☆31Updated last year
- ☆101Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 7 months ago
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆22Updated 11 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Updated 10 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆54Updated last year
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58Updated 7 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 11 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- Website with current metrics on the fastest AI models.☆42Updated last year
- Have a natural voice conversation with an LLM☆260Updated 2 months ago
- xllamacpp - a Python wrapper of llama.cpp☆68Updated this week
- Ask shortgpt for instant and concise answers☆12Updated 2 years ago
- Various LLM Benchmarks☆24Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Updated last year
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆57Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Updated last year