itsmostafa / inference-speed-testsLinks
Local LLM inference speed tests on various devices
☆115Updated 3 weeks ago
Alternatives and similar repositories for inference-speed-tests
Users that are interested in inference-speed-tests are comparing it to the libraries listed below
Sorting:
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆279Updated last week
- Your gateway to both Ollama & Apple MlX models☆149Updated 10 months ago
- MacOS menu‑bar utility to adjust Apple Silicon GPU VRAM allocation☆247Updated 9 months ago
- Accessing Apple Intelligence and ChatGPT desktop through OpenAI / Ollama API☆335Updated 5 months ago
- Local Apple Notes + LLM Chat☆97Updated last month
- A macOS AppleScript MCP server☆354Updated 9 months ago
- Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. …☆511Updated 3 weeks ago
- A wannabe Ollama equivalent for Apple MlX models☆81Updated 11 months ago
- High-performance MLX-based LLM inference engine for macOS with native Swift implementation☆476Updated 2 weeks ago
- Local image and music generation for Apple Silicon☆72Updated 10 months ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆656Updated last month
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆226Updated 5 months ago
- MCP server that execute applescript giving you full control of your Mac☆413Updated 2 months ago
- Link you Ollama models to LM-Studio☆150Updated last year
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆418Updated 3 months ago
- AI agent that controls computer with OS-level tools, MCP compatible, works with any model☆132Updated 5 months ago
- macOS whisper dictation app☆599Updated this week
- Qwen Image models through MPS☆256Updated last month
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆801Updated 3 weeks ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆259Updated 3 months ago
- ☆200Updated 10 months ago
- The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.☆1,182Updated this week
- A native desktop app for Open WebUI built with Tauri v2☆82Updated this week
- Ollama desktop client for everyday use☆89Updated 8 months ago
- End-to-end documentation to set up your own local & fully private LLM server on Debian. Equipped with chat, web search, RAG, model manage…☆668Updated 3 months ago
- open source assistant hybrid using small models (2b - 5b) and gemini , with image and agentic tool capabilities and integration of RAG…☆227Updated 4 months ago
- A collection of 2025 agentic workflows built in n8n. Showcases manual multi-model orchestration, RAG-to-SQL, and autonomous research pipe…☆79Updated this week
- Welcome!☆141Updated last year
- 'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation Models through OpenAI-compatible API endpoints. Su…☆103Updated this week
- A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒☆1,244Updated this week