remixer-dec / llama-mpsLinks
Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
☆86Updated 2 years ago
Alternatives and similar repositories for llama-mps
Users that are interested in llama-mps are comparing it to the libraries listed below
Sorting:
- LLM plugin for running models using MLC☆191Updated last year
- Tiny inference-only implementation of LLaMA☆92Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Augment GPT-4 Environment Access☆285Updated 2 years ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆213Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- GPT-3 on your command line☆130Updated 2 years ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Supercharge Open-Source AI Models☆349Updated 2 years ago
- ☆255Updated 2 years ago
- Enforce structured output from LLMs 100% of the time☆250Updated last year
- Array-Inspired Pipeline Language☆120Updated 2 years ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware☆362Updated 2 years ago
- An implementation of bucketMul LLM inference☆223Updated last year
- Stop messing around with finicky sampling parameters and just use DRµGS!☆360Updated last year
- Inference code for LLaMA models☆189Updated 2 years ago
- Finetune llama2-70b and codellama on MacBook Air without quantization☆450Updated last year
- ☆135Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆158Updated 2 years ago
- JS tokenizer for LLaMA 1 and 2☆362Updated last year
- Tensor library for machine learning☆274Updated 2 years ago
- Demos utilizing the ChatGPT API☆94Updated 2 years ago
- Enable decision-making based on simulations☆231Updated last year
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆160Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆239Updated 2 years ago
- Text generator prompting with Boolean operators☆181Updated last month