remixer-dec / llama-mpsLinks
Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
☆86Updated last year
Alternatives and similar repositories for llama-mps
Users that are interested in llama-mps are comparing it to the libraries listed below
Sorting:
- LLM plugin for running models using MLC☆188Updated last year
- Tiny inference-only implementation of LLaMA☆93Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated 2 years ago
- Enable decision-making based on simulations☆227Updated last year
- GPT-3 on your command line☆131Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆152Updated last year
- Augment GPT-4 Environment Access☆285Updated 2 years ago
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆213Updated last year
- Supercharge Open-Source AI Models☆350Updated 2 years ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated last year
- Enforce structured output from LLMs 100% of the time☆249Updated 11 months ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆118Updated 10 months ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Array-Inspired Pipeline Language☆119Updated last year
- Tool to create a dataset of semantic segmentation on website screenshots from their DOM☆89Updated 2 years ago
- Inference code for LLaMA models☆188Updated 2 years ago
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆161Updated 2 years ago
- Demos utilizing the ChatGPT API☆94Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- ☆252Updated 2 years ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆349Updated last year
- An implementation of bucketMul LLM inference☆220Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- Some of the scripts I use for scribepod @ https://scribepod.substack.com/, an automated AI podcast☆171Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Run inference on replit-3B code instruct model using CPU☆156Updated 2 years ago