EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making. EdgeInfer 旨在资源受限的设备上运行小型 AI 模型(包括向量化和 Onnx 模型),如 Android、iOS 或 MCUs,实现高效的边缘智能,用于实时决策。
☆51Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for edge-infer
Users that are interested in edge-infer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boston - AI Assistant is an iOS, iPadOS, macOS, and visionOS application that uses SiriKit and OpenAI API's to allow users to access Chat…☆21Sep 12, 2025Updated 9 months ago
- Port of Helix MP3 code to ESP8266☆14Dec 8, 2017Updated 8 years ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- An excel-like spreadsheet component for SQLPage☆16Aug 4, 2025Updated 10 months ago
- Portable LLM - A rust library for LLM inference☆12Apr 13, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Simple Prompt Plugin is a plugin for Obsidian that allows you generate content in your notes using LLMs.☆14Jun 16, 2024Updated last year
- ☆13Nov 4, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆23Jan 2, 2026Updated 5 months ago
- Utility to write images to SD cards under Android. Can patch A10 images☆18Jan 27, 2013Updated 13 years ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 10 months ago
- ☆35Oct 29, 2025Updated 7 months ago
- Using LLMs to manage files and generating metadata such as tags and summaries.☆17Apr 11, 2025Updated last year
- Plugging LLMs into Android's Assistant API☆14Jun 4, 2025Updated last year
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…☆51Nov 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Triton JIT runtime and ffi provider in C++☆35May 27, 2026Updated 2 weeks ago
- FastTrack4LLM 是一个为大模型学习者准备的大模型学习与实践框架,帮助他们轻松掌握大模型的核心原理与训练流程,让每个人都能真正理解大模型的内部机制。本项目不仅完整复现了 LLaMA、Qwen、DeepSeek 等主流开源大模型架构,还覆盖了大模型的全生命周期:To…☆31Nov 6, 2025Updated 7 months ago
- implement llava using candle☆15Jun 9, 2024Updated 2 years ago
- Gradient Themes For Plasma Desktop☆14Dec 14, 2025Updated 6 months ago
- Actually Helpful Digital Assistant: let AI control your PC☆14Mar 5, 2025Updated last year
- Developing a high-precision legal expert LLM application called Contract Advisor RAG. The project's goal is to create a Retrieval Augment…☆16Apr 10, 2024Updated 2 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Nvidia spoofer for AMD and Intel cards on Vulkan☆13Apr 1, 2024Updated 2 years ago
- PrompFlower 1.0 is a command-line tool that generates AI prompts entirely locally and offline using a local AI engine via Ollama.☆13Mar 18, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.☆16Mar 17, 2023Updated 3 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Sep 26, 2024Updated last year
- The world's first sybil resistant, fully decentralized reputation protocol.☆29Mar 7, 2019Updated 7 years ago
- 现代AI企业网站☆26May 14, 2026Updated last month
- Improved IPC for Electron☆12Nov 6, 2017Updated 8 years ago
- Persys desktop. Electron based application to access your Persys server.☆16May 16, 2025Updated last year
- Your AI Copilot in Rust☆51Dec 17, 2023Updated 2 years ago
- Supercharged pandas indexing☆11Mar 28, 2021Updated 5 years ago
- Translate between related languages on your mobile device.☆33Apr 28, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Interface Driven Distributed Data Service☆29Jan 17, 2025Updated last year
- Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content …☆13Jan 24, 2024Updated 2 years ago
- Chat with any website on your local machine☆85Jun 30, 2024Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆65Oct 3, 2024Updated last year
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆33May 24, 2025Updated last year
- 📄 Project description, planning & documentation.☆14May 4, 2026Updated last month
- MQTT broker☆11Apr 2, 2026Updated 2 months ago