venuatu / llamaLinks

Inference code for LLaMA models

☆46

Alternatives and similar repositories for llama

Users that are interested in llama are comparing it to the libraries listed below

Sorting:

shawwn / llama
Inference code for LLaMA models
☆188Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
modular-ml / wrapyfi-examples_llama
Inference code for facebook LLaMA models with Wrapyfi support
☆129Updated 2 years ago
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
zphang / minimal-llama
☆457Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆65Updated 2 years ago
bjoernpl / llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
☆48Updated 2 years ago
s4rduk4r / alpaca_lora_4bit_readme
Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit
☆31Updated 2 years ago
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆70Updated 2 years ago
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
replicate / cog_stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆110Updated 2 years ago
cedrickchee / llama
Inference code for LLaMA 2 models
☆30Updated last year
harrisonvanderbyl / rwkv_chatbot
rwkv_chatbot
☆61Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆50Updated 2 years ago
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆308Updated last year
skeskinen / llama-lite
Embeddings focused small version of Llama NLP model
☆105Updated 2 years ago
pointnetwork / point-alpaca
☆403Updated 2 years ago
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated last year
AmericanPresidentJimmyCarter / yal-discord-bot
Yet Another LLaMA/ALPACA Discord Bot
☆69Updated 2 years ago
KohakuBlueleaf / guanaco-lora
Instruct-tune LLaMA on consumer hardware
☆73Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆130Updated 2 years ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
clcarwin / alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
☆52Updated 2 years ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago