simonw / llm-mlx-llamaLinks
Run Llama 2 using MLX on macOS
☆34Updated 2 years ago
Alternatives and similar repositories for llm-mlx-llama
Users that are interested in llm-mlx-llama are comparing it to the libraries listed below
Sorting:
- Create an LLM XML context document from an llms.txt file☆23Updated last year
- Access the Cohere Command R family of models☆38Updated 9 months ago
- Jim is a simple, beautiful Jupyter notebook editor for macOS☆35Updated 2 years ago
- Verbosity control for AI agents☆65Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- Embedding models from Jina AI☆65Updated 2 years ago
- A cookiecutter template for building plugins for LLM☆29Updated last month
- A collection of tools for your LLMs that run on Modal☆23Updated 10 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Updated last year
- Run embedding models using ONNX☆35Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆218Updated 3 weeks ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆92Updated last week
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Updated last year
- ☆23Updated 7 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- Code interpreter support for o1☆31Updated last year
- ☆47Updated last year
- LLM plugin for clustering embeddings☆82Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Convert a web page to markdown☆80Updated last year
- ☆24Updated last year
- Plugin for LLM adding a Markov chain generating model☆20Updated last year
- Code Interpreter Replica☆26Updated 2 years ago
- ☆19Updated 2 years ago
- Very minimal (and stateless) agent framework☆44Updated last year
- Dialectical reasoning architecture for LLMs (Thesis → Antithesis → Synthesis)☆106Updated last week
- Inference examples☆65Updated 4 months ago