simonw / llm-mlx-llamaLinks

Run Llama 2 using MLX on macOS

☆34

Alternatives and similar repositories for llm-mlx-llama

Users that are interested in llm-mlx-llama are comparing it to the libraries listed below

Sorting:

AnswerDotAI / llm-ctx
Create an LLM XML context document from an llms.txt file
☆21Updated 11 months ago
simonw / llm-embed-jina
Embedding models from Jina AI
☆61Updated last year
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
replicate / hype
A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.
☆130Updated last month
FanaHOVA / openai-o1-code-interpreter
Code interpreter support for o1
☆32Updated 10 months ago
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆26Updated 8 months ago
simonw / llm-cluster
LLM plugin for clustering embeddings
☆77Updated last year
simonw / llm-anyscale-endpoints
LLM plugin for models hosted by Anyscale Endpoints
☆33Updated last year
g-aggarwal / mlx-hub
A python command-line tool to download & manage MLX AI models from Hugging Face.
☆18Updated 11 months ago
marimo-team / marimo-labs
☆20Updated last month
simonw / language-models-on-the-command-line
Handout for a talk I gave about LLM and CLI tools
☆63Updated last year
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
cfahlgren1 / hf-data-explorer
Chrome Extension for exploring Hugging Face datasets 🔎
☆50Updated 10 months ago
Alignment-Lab-AI / KnowledgeBase
never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…
☆37Updated last year
marimo-team / examples
A curated collection of example marimo notebooks — use these as templates for your own experiments, workflows, and tools.
☆46Updated this week
sacha-ichbiah / outlines-mlx
A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX
☆55Updated last year
simonw / llm-embed-onnx
Run embedding models using ONNX
☆34Updated last year
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
LucasSte / MLX-vs-Pytorch
Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs
☆88Updated last year
swyxio / openlangmem
☆47Updated last year
simonw / llm-command-r
Access the Cohere Command R family of models
☆37Updated 4 months ago
AnswerDotAI / web2md-ext
Get a markdown version of any webpage with a keyboard shortcut.
☆65Updated 5 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
irthomasthomas / undecidability
☆22Updated 11 months ago
simonw / llm-plugin
A cookiecutter template for building plugins for LLM
☆28Updated 3 months ago
mobarski / aidapter
Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)
☆20Updated last year
ivanleomk / modal-grpo
☆20Updated 4 months ago
mbusigin / yaml-runner
☆57Updated 2 years ago
philipp-eisen / modal-mcp-toolbox
A collection of tools for your LLMs that run on Modal
☆21Updated 5 months ago