simonw / llm-mlx-llama
Run Llama 2 using MLX on macOS
☆33Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-mlx-llama
- Run embedding models using ONNX☆23Updated 9 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 7 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- ☆15Updated 10 months ago
- Server-side logic for an LLM application to make your prose clearer and more objective.☆25Updated 10 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆108Updated 2 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆57Updated 4 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 6 months ago
- ☆30Updated last year
- SQL functions for calling OpenAI APIs☆21Updated last year
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆16Updated 2 months ago
- Apps that run on modal.com☆12Updated 5 months ago
- ☆26Updated 2 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆39Updated last week
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Maintain a FAISS index for specified Datasette tables☆35Updated 5 months ago
- This repo lets you run mistral-7b in Google Colab.☆16Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Create an LLM XML context document from an llms.txt file☆13Updated 2 months ago
- ☆3Updated 3 months ago
- ☆10Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- LMQL implementation of tree of thoughts☆33Updated 9 months ago
- Embedding models from Jina AI☆56Updated 10 months ago
- LLM plugin for embeddings using sentence-transformers☆43Updated 9 months ago
- Latent Large Language Models☆16Updated 2 months ago
- Plugin for LLM adding support for Google's PaLM 2 model☆14Updated last year