simonw / llm-mlx-llama
Run Llama 2 using MLX on macOS
☆34Updated last year
Alternatives and similar repositories for llm-mlx-llama:
Users that are interested in llm-mlx-llama are comparing it to the libraries listed below
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Run embedding models using ONNX☆32Updated last year
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Create embeddings for LLM using the Nomic API☆23Updated 5 months ago
- MCP remote server for AI Engineer World's Fair 2025☆14Updated 2 weeks ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated last year
- Create an LLM XML context document from an llms.txt file☆18Updated 8 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆79Updated 9 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- ☆32Updated last year
- alternative way to calculating self attention☆18Updated 11 months ago
- ☆15Updated last year
- ☆18Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- Embedding models from Jina AI☆59Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 10 months ago
- ☆4Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 3 months ago
- A daemon that makes a desktop OS accessible to AI agents☆26Updated 2 weeks ago
- SQL functions for calling OpenAI APIs☆21Updated 2 years ago
- A collection of tools for your LLMs that run on Modal☆18Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 8 months ago
- Apps that run on modal.com☆12Updated 11 months ago
- ☆36Updated 5 months ago