remixer-dec / llama-mps
Experimental fork of Facebooks LLaMa model which runs it with GPU acceleration on Apple Silicon M1/M2
☆87Updated last year
Alternatives and similar repositories for llama-mps:
Users that are interested in llama-mps are comparing it to the libraries listed below
- Tiny inference-only implementation of LLaMA☆91Updated 9 months ago
- LLM plugin for running models using MLC☆182Updated 10 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆135Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Demos utilizing the ChatGPT API☆95Updated last year
- WebGPU LLM inference tuned by hand☆148Updated last year
- ☆40Updated last year
- Embedding models from Jina AI☆57Updated last year
- Visualize text embeddings☆34Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- ☆136Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Let's create synthetic textbooks together :)☆73Updated last year
- Replace expensive LLM calls with finetunes automatically☆62Updated 11 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Array-Inspired Pipeline Language☆119Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 4 months ago
- Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) wor…☆210Updated last year
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- ☆38Updated 10 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- GPT-3 on your command line☆131Updated last year
- Use context-free grammars with an LLM☆167Updated 10 months ago
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 2 years ago