ziozzang / Mac_mlx_phi-2_server
Test server code for Phi-2 model. support OpenAI API spec
☆16Updated last year
Alternatives and similar repositories for Mac_mlx_phi-2_server:
Users that are interested in Mac_mlx_phi-2_server are comparing it to the libraries listed below
- a version of baby agi using dspy and typed predictors☆17Updated 10 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆46Updated 8 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆68Updated 6 months ago
- ☆15Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆15Updated 2 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆22Updated 6 months ago
- LMQL implementation of tree of thoughts☆33Updated 11 months ago
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆110Updated this week
- Run large models from the terminal using Apple MLX.☆27Updated 10 months ago
- A QT GUI for large language models☆27Updated last year
- High level tool use for LLMs☆34Updated 5 months ago
- An experimental and alternative approach to Finetuning and RAG.☆35Updated last year
- A function to do all☆35Updated 9 months ago
- Developer showcase of projects built on Cartesia☆16Updated 4 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆67Updated last month
- CLI tool for text to image generation using the FLUX.1 model.☆49Updated last month
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- Access the Cohere Command R family of models☆34Updated 9 months ago
- Scripts to create your own moe models using mlx☆85Updated 10 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 8 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆26Updated 3 months ago
- Very basic framework for parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture …☆36Updated last week
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 6 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆24Updated last year
- LLM code editor for backend services☆13Updated 3 months ago
- A simple swift app for MacOS/iOS to test large language models (LLM)☆25Updated last year
- ☆38Updated 10 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago