raullenchai / Rapid-MLXView on GitHub
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.
88Mar 23, 2026Updated this week

Alternatives and similar repositories for Rapid-MLX

Users that are interested in Rapid-MLX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?