unit-mesh / edge-inferLinks
EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making. EdgeInfer 旨在资源受限的设备上运行小型 AI 模型(包括向量化和 Onnx 模型),如 Android、iOS 或 MCUs,实现高效的边缘智能,用于实时决策。
☆47Updated last year
Alternatives and similar repositories for edge-infer
Users that are interested in edge-infer are comparing it to the libraries listed below
Sorting:
- Auto Thinking Mode switch for Qwen3 in Open webui☆67Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 6 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 11 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆53Updated 3 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 7 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆114Updated last year
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Updated 6 months ago
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated 2 years ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆106Updated last month
- coze api to openai☆14Updated 11 months ago
- MiniCPM on iOS.☆68Updated 5 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆24Updated 11 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 11 months ago
- Rust implementation of Surya☆60Updated 5 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 3 months ago
- support BM25+vecetor☆29Updated 3 months ago
- Conversational Retrieval Evaluation Dataset☆101Updated last week
- CursorCore: Assist Programming through Aligning Anything☆131Updated 6 months ago
- ☆37Updated 3 weeks ago
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆23Updated 7 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆54Updated last year
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆53Updated last year
- TiDB Vector SDK for Python, including code examples. Join our Discord: https://discord.gg/XzSW23Jg9p☆59Updated last month
- Website with current metrics on the fastest AI models.☆43Updated 9 months ago
- Library for model distillation☆150Updated 6 months ago
- xllamacpp - a Python wrapper of llama.cpp☆52Updated this week
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆82Updated 3 weeks ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆21Updated 3 months ago