remixer-dec / llama-mps
Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
☆86Updated last year
Alternatives and similar repositories for llama-mps
Users that are interested in llama-mps are comparing it to the libraries listed below
Sorting:
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- LLM plugin for running models using MLC☆186Updated last year
- Tiny inference-only implementation of LLaMA☆93Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated 2 years ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Vector search dictionary definitions☆44Updated 2 years ago
- WebGPU LLM inference tuned by hand☆149Updated last year
- GPT-3 on your command line☆132Updated last year
- Embedding models from Jina AI☆59Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆233Updated 2 years ago
- Array-Inspired Pipeline Language☆119Updated last year
- Visualize text embeddings☆39Updated last year
- Praetor is a lightweight finetuning data and prompt management tool☆68Updated 5 months ago
- Mapping the French Culinary Universe☆48Updated 2 months ago
- Dead Simple LLM Abliteration☆214Updated 2 months ago
- For inferring and serving local LLMs using the MLX framework☆103Updated last year
- a curated list of data for reasoning ai☆136Updated 9 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Demos utilizing the ChatGPT API☆95Updated 2 years ago
- An AI-driven tool to analyze your profile and gain insights into how ChatGPT interprets your personality.☆181Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- An implementation of bucketMul LLM inference☆217Updated 10 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- ☆163Updated 11 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 11 months ago
- Your friendly terminal-based AI pair programmer☆42Updated last year
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆123Updated 2 months ago