remixer-dec / llama-mpsLinks

Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2

☆86

Alternatives and similar repositories for llama-mps

Users that are interested in llama-mps are comparing it to the libraries listed below

Sorting:

simonw / llm-mlc
LLM plugin for running models using MLC
☆188Updated last year
recmo / cria
Tiny inference-only implementation of LLaMA
☆93Updated last year
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆324Updated 2 years ago
simulatrex / simulatrex-engine
Enable decision-making based on simulations
☆227Updated last year
jayhack / llm.sh
GPT-3 on your command line
☆131Updated 2 years ago
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆152Updated last year
refcell / run-wild
Augment GPT-4 Environment Access
☆285Updated 2 years ago
Hellisotherpeople / Constrained-Text-Generation-Studio
Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…
☆213Updated last year
catid / supercharger
Supercharge Open-Source AI Models
☆350Updated 2 years ago
IntrinsicLabsAI / gbnfgen
TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces
☆137Updated last year
automorphic-ai / trex
Enforce structured output from LLMs 100% of the time
☆249Updated 11 months ago
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆131Updated 2 years ago
charstorm / llmbinge
A web-app to explore topics using LLM (less typing and more clicks)
☆67Updated last year
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated 10 months ago
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
saulpw / aipl
Array-Inspired Pipeline Language
☆119Updated last year
dmvaldman / html_semantic_seg
Tool to create a dataset of semantic segmentation on website screenshots from their DOM
☆89Updated 2 years ago
shawwn / llama
Inference code for LLaMA models
☆188Updated 2 years ago
jiggy-ai / hnsqlite
hnsqlite integrates hnswlib and sqlite for simple text embedding search
☆161Updated 2 years ago
minimaxir / chatgpt_api_test
Demos utilizing the ChatGPT API
☆94Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
andyk / recursive_llm
Implement recursion using English as the programming language and an LLM as the runtime.
☆238Updated 2 years ago
Futrell / ziplm
☆252Updated 2 years ago
EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆349Updated last year
kolinko / effort
An implementation of bucketMul LLM inference
☆220Updated last year
OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆382Updated last year
yacineMTB / scribepod
Some of the scripts I use for scribepod @ https://scribepod.substack.com/, an automated AI podcast
☆171Updated 2 years ago
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆75Updated 2 years ago
abacaj / replit-3B-inference
Run inference on replit-3B code instruct model using CPU
☆156Updated 2 years ago