alexgusevski / anemll-serverLinks
An OpenAI API compatible FastAPI server that sits on top of the Anemll repo. Tested with Open WebUI.
☆17Updated 8 months ago
Alternatives and similar repositories for anemll-server
Users that are interested in anemll-server are comparing it to the libraries listed below
Sorting:
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 5 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆129Updated 2 weeks ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated last month
- Distributed Inference for mlx LLm☆99Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- Train Large Language Models on MLX.☆236Updated 2 weeks ago
- ☆43Updated last month
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆69Updated last month
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆120Updated last week
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆236Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 4 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆157Updated last week
- ☆62Updated 5 months ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year
- Sparse Inferencing for transformer based LLMs☆215Updated 4 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 6 months ago
- Find the hidden meaning of LLMs☆39Updated last month
- ☆126Updated 5 months ago
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Updated last year
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Updated 9 months ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆29Updated this week
- Fast parallel LLM inference for MLX☆238Updated last year
- A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI…☆156Updated this week
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆122Updated last year
- Your personal and private AI☆52Updated 8 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 7 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆275Updated last month
- A web application that converts speech to speech 100% private☆81Updated 6 months ago
- ☆29Updated 8 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 11 months ago