Ronsor / llama-toolsLinks

Tools for the LLaMA language model

☆12

Alternatives and similar repositories for llama-tools

Users that are interested in llama-tools are comparing it to the libraries listed below

Sorting:

karim-aloulou / Espitchatbot-RASA-RAVEN
Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven
☆13Updated 2 years ago
lachlansneff / sparsellama
☆40Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
kyegomez / Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
☆151Updated 10 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆75Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated last week
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated 2 years ago
AlpinDale / LLM-Shearing
Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆28Updated last year
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
kafischer / AIMindFlow
The first AI artist
☆32Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
SLAM-group / newhope
☆22Updated last year
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Updated 2 years ago
s4rduk4r / alpaca_lora_4bit_readme
Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit
☆31Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
devbrones / llama-prompts
A collection of prompts for Llama
☆100Updated 2 years ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated 10 months ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
catid / oaillama3
Simple setup to self-host LLaMA3-70B model with an OpenAI API
☆19Updated last year
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
teknium1 / stanford_alpaca-replit
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆41Updated 2 years ago