alexgusevski / anemll-serverLinks
An OpenAI API compatible FastAPI server that sits on top of the Anemll repo. Tested with Open WebUI.
☆12Updated 2 months ago
Alternatives and similar repositories for anemll-server
Users that are interested in anemll-server are comparing it to the libraries listed below
Sorting:
- ☆25Updated 2 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆84Updated 5 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month
- Train Large Language Models on MLX.☆77Updated this week
- ☆28Updated last month
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆30Updated 2 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆165Updated last week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated 9 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆30Updated 3 months ago
- ☆90Updated 5 months ago
- I'll be your machinery.☆14Updated last month
- ModernBERT model optimized for Apple Neural Engine.☆26Updated 4 months ago
- ☆75Updated 2 weeks ago
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆17Updated 2 months ago
- Thoughtful Lightning AI Assistant - Dual-engine system with DeepSeek reasoning and Groq inference, featuring Gradio UI, secure API manage…☆20Updated 4 months ago
- For inferring and serving local LLMs using the MLX framework☆104Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆34Updated 5 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 7 months ago
- Distributed Inference for mlx LLm☆92Updated 10 months ago
- ☆114Updated 5 months ago
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Updated 9 months ago
- entropix style sampling + GUI☆26Updated 7 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated last week
- Letting Claude Code develop his own MCP tools :)☆105Updated 2 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆399Updated last week
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆38Updated 3 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 5 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆24Updated 3 weeks ago